Joel W. Robbi 
Dietmar A. Salar 


Introduction 
to Differential 
Geometry 


Q) Springer Spektrum 


Springer Studium Mathematik (Master) 


Series Editors 

Heike FaBibender, Institut fiir Numerische Mathematik, Technische Universitat 
Braunschweig, Braunschweig, Germany 

Barbara Gentz, Fakultat fiir Mathematik, Universitat Bielefeld, Bielefeld, Germany 


Daniel Grieser, Institut fiir Mathematik, Carl von Ossietzky Universitat Oldenburg, 
Oldenburg, Germany 

Peter Gritzmann, Zentrum Mathematik, Technische Universitat Miinchen, 
Garching b. Mtinchen, Germany 

Jiirg Kramer, Institut fiir Mathematik, Humboldt-Universitat zu Berlin, Berlin, 
Germany 


Gitta Kutyniok, Mathematisches Institut, Ludwig-Maximilians-Universitat 
Miinchen, Miinchen, Germany 


Volker Mehrmann, Institut fiir Mathematik, Technische Universitat Berlin, Berlin, 
Germany 


Die Buchreihe Springer Studium Mathematik orientiert sich am Aufbau der 
Bachelor- und Masterstudienginge im deutschsprachigen Raum: Neben einer 
soliden Grundausbildung in Mathematik vermittelt sie fachtibergreifende und an- 
wendungsbezogene Kompetenzen. 

Zielgruppe der Lehr- und Ubungsbiicher sind vor allem Studierende und Dozenten 
mathematischer Studienginge, aber auch anderer Fachrichtungen. 

Die Unterreihe Springer Studium Mathematik (Bachelor) bietet Orientierung 
insbesondere beim Ubergang von der Schule zur Hochschule sowie zu Beginn des 
Studiums. Sie unterstiitzt durch ansprechend aufbereitete Grundlagen, Beispiele 
und Ubungsaufgaben und mochte Studierende fiir die Prinzipien und Arbeitsweisen 
der Mathematik begeistern. Titel dieser Reihe finden Sie unter springer.com/series/ 
16564. 

Die Unterreihe Springer Studium Mathematik (Master) bietet studierenden- 
gerechte Darstellungen forschungsnaher Themen und unterstiitzt fortgeschrittene 
Studierende so bei der Vertiefung und Spezialisierung. Dem Wandel des Studi- 
enangebots entsprechend sind diese Biicher in deutscher oder englischer Sprache 
verfasst. Titel dieser Reihe finden Sie unter springer.com/series/16565. 

Die Reihe biindelt verschiedene erfolgreiche Lehrbuchreihen und fiihrt diese 
systematisch fort. Insbesondere werden hier Titel weitergefiihrt, die vorher veréf- 
fentlicht wurden unter: 


Springer Studium Mathematik — Bachelor (springer.com/series/13446) 
Springer Studium Mathematik — Master (springer.com/series/13893) 
Bachelorkurs Mathematik (springer.com/series/13643) 

Aufbaukurs Mathematik (springer.com/series/12357) 

Advanced Lectures in Mathematics (springer.com/series/12121) 


More information about this series at http://www.springer.com/series/16565 


Joel W. Robbin - Dietmar A. Salamon 


Introduction to 
Differential Geometry 


D) Springer 


Joel W. Robbin Dietmar A. Salamon 
University of Wisconsin—Madison ETH Ziirich 
Madison, USA Ziirich, Switzerland 


Springer Studium Mathematik (Master) 
ISBN 978-3-662-64339-6 ISBN 978-3-662-64340-2 (eBook) 
https://doi.org/10.1007/978-3-662-64340-2 


© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer-Verlag GmbH, 
DE, part of Springer Nature 2022 

This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part 
of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, 
recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or 
information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar 
methodology now known or hereafter developed. 

The use of general descriptive names, registered names, trademarks, service marks, etc. in this publica- 
tion does not imply, even in the absence of a specific statement, that such names are exempt from the 
relevant protective laws and regulations and therefore free for general use. 

The publisher, the authors and the editors are safe to assume that the advice and information in this book 
are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or 
the editors give a warranty, express or implied, with respect to the material contained herein or for any 
errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional 
claims in published maps and institutional affiliations. 


This Springer Spektrum imprint is published by the registered company Springer-Verlag GmbH, DE part 
of Springer Nature. 
The registered company address is: Heidelberger Platz 3, 14197 Berlin, Germany 


Preface 


These are notes for the lecture course “Differential Geometry I” given by the second 
author at ETH Ziirich in the fall semester 2017. They are based on a lecture course! 
given by the first author at the University of Wisconsin—Madison in the fall semester 
1983. 

One can distinguish extrinsic differential geometry and intrinsic differential ge- 
ometry. The former restricts attention to submanifolds of Euclidean space while the 
latter studies manifolds equipped with a Riemannian metric. The extrinsic theory 
is more accessible because we can visualize curves and surfaces in R°, but some 
topics can best be handled with the intrinsic theory. The definitions in Chap. 2 have 
been worded in such a way that it is easy to read them either extrinsically or in- 
trinsically and the subsequent chapters are mostly (but not entirely) extrinsic. One 
can teach a self contained one semester course in extrinsic differential geometry 
by starting with Chap. 2 and skipping the sections marked with an asterisk such 
as Sect. 2.8. 

Here is a description of the content of the book, chapter by chapter. Chapter | 
gives a brief historical introduction to differential geometry and explains the extrin- 
sic versus the intrinsic viewpoint of the subject.* This chapter was not included in 
the lecture course at ETH. 

The mathematical treatment of the field begins in earnest in Chap. 2, which in- 
troduces the foundational concepts used in differential geometry and topology. It 
begins by defining manifolds in the extrinsic setting as smooth submanifolds of 
Euclidean space, and then moves on to tangent spaces, submanifolds and embed- 
dings, and vector fields and flows.* The chapter includes an introduction to Lie 
groups in the extrinsic setting and a proof of the Closed Subgroup Theorem. It 
then discusses vector bundles and submersions and proves the Theorem of Frobe- 
nius. The last two sections deal with the intrinsic setting and can be skipped at first 
reading. 


' Extrinsic Differential Geometry. 
? It is shown in Sect. 1.3 how any topological atlas on a set induces a topology. 
3 Our sign convention for the Lie bracket of vector fields is explained in Sect. 2.5.7. 
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Chapter 3 introduces the Levi-Civita connection as covariant derivatives of vec- 
tor fields along curves.’ It continues with parallel transport, introduces motions 
without sliding, twisting, and wobbling, and proves the Development Theorem. It 
also characterizes the Levi-Civita connection in terms of the Christoffel symbols. 
The last section introduces Riemannian metrics in the intrinsic setting, establishes 
their existence, and characterizes the Levi-Civita connection as the unique torsion- 
free Riemannian connection on the tangent bundle. 

Chapter 4 defines geodesics as critical points of the energy functional and in- 
troduces the distance function defined in terms of the lengths of curves. It then 
examines the exponential map, establishes the local existence of minimal geodesics, 
and proves the existence of geodesically convex neighborhoods. A highlight of this 
chapter is the proof of the Hopf—Rinow Theorem and of the equivalence of geodesic 
and metric completeness. The last section shows how these concepts and results 
carry over to the intrinsic setting. 

Chapter 5 introduces isometries and the Riemann curvature tensor and proves the 
Generalized Theorema Egregium, which asserts that isometries preserve geodesics, 
the covariant derivative, and the curvature. 

Chapter 6 contains some answers to what can be viewed as the fundamental 
problem of differential geometry: When are two manifolds isometric? The cen- 
tral tool for answering this question is the Cartan—Ambrose—Hicks Theorem, which 
etablishes necessary and sufficient conditions for the existence of a (local) isome- 
try between two Riemannian manifolds. The chapter then moves on to examine flat 
spaces, symmetric spaces, and constant sectional curvature manifolds. It also in- 
cludes a discussion of manifolds with nonpositive sectional curvature, proofs of the 
Cartan—Hadamard Theorem and of Cartan’s Fixed Point Theorem, and as the main 
example a discussion of the space of positive definite symmetric matrices equipped 
with a natural Riemannian metric of nonpositive sectional curvature. 

This is the point at which the ETH lecture course ended. However, Chap. 6 con- 
tains some additional material such as a proof of the Bonnet-Myers Theorem about 
manifolds with positive Ricci curvature, and it ends with brief discussions of the 
scalar curvature and the Wey] tensor. 

The logical progression of the book up to this point is linear in that every chapter 
builds on the material of the previous one, and so no chapter can be skipped except 
for the first. What can be skipped at first reading are only the sections labelled with 
an asterisk that carry over the various notions introduced in the extrinsic setting to 
the intrinsic setting. 

Chapter 7 deals with various specific topics that are at the heart of the subject but 
go beyond the scope of a one semester lecture course. It begins with a section on 
conjugate points and the Morse Index Theorem, which follows on naturally from 
Chap. 4 about geodesics. These results in turn are used in the proof of continuity 
of the injectivity radius in the second section. The third section builds on Chap. 5 
on isometries and the Riemann curvature tensor. It contains a proof of the Myers— 
Steenrod Theorem, which asserts that the group of isometries is always a finite- 


+ The covariant derivative of a global vector field is deferred to Sect. 5.2.2. 
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dimensional Lie group. The fourth section examines the special case of the isometry 
group of a compact Lie group equipped with a bi-invariant Riemannian metric. The 
last two sections are devoted to Donaldson’s differential geometric approach to Lie 
algebra theory as explained in [17]. They build on all the previous chapters and 
especially on the material in Chap. 6 about manifolds with nonpositive sectional 
curvature. The fifth section establishes conditions under which a convex function 
on a Hadamard manifold has a critical point. The last section uses these results to 
show that the Killing form on a simple Lie algebra is nondegenerate, to establish 
uniqueness up to conjugation of maximal compact subgroups of the automorphism 
group of a semisimple Lie algebra, and to prove Cartan’s theorem about the compact 
real form of a semisimple complex Lie algebra. 

The appendix contains brief discussions of some fundamental notions of analysis 
such as maps and functions, normal forms, and Euclidean spaces, that play a central 
role throughout this book. 

We thank everyone who pointed out errors or typos in earlier versions of this 
book. In particular, we thank Charel Antony and Samuel Trautwein for many help- 
ful comments. We also thank Daniel Grieser for his constructive suggestions con- 
cerning the exposition. 


28 August 2021 Joel W. Robbin 
Dietmar A. Salamon 
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What Is Differential Geometry? 


This preparatory chapter contains a brief historical introduction to the subject of dif- 
ferential geometry (Sect. 1.1), explains the concept of a coordinate chart (Sect. 1.2), 
discusses topological manifolds and shows how an atlas on a set determines a topol- 
ogy (Sect. 1.3), introduces the notion of a smooth structure (Sect. 1.4), and outlines 
the master plan for this book (Sect. 1.5). 


1.1 Cartography and Differential Geometry 


Carl Friedrich Gaus (1777-1855) is the father of differential geometry. He was 
(among many other things) a cartographer and many terms in modern differential 
geometry (chart, atlas, map, coordinate system, geodesic, etc.) reflect these ori- 
gins. He was led to his Theorema Egregium (see Theorem 5.3.1) by the question 
of whether it is possible to draw an accurate map of a portion of our planet. Let us 
begin by discussing a mathematical formulation of this problem. 

Consider the two-dimensional sphere S? sitting in the three-dimensional Eu- 
clidean space R?. It is cut out by the equation 


x4 y?+z7=1, 


A map of a small region U C S? is represented mathematically by a one-to-one 
correspondence with a small region in the plane z = 0. In this book we will represent 
this with the notation @ : U > ¢(U) C R? and call such an object a chart or a system 
of local coordinates (see Fig. 1.1). 

What does it mean that @ is an “accurate” map? Ideally the user would want 
to use the map to compute the length of a curve in S*. The length of a curve y 
connecting two points p,q € S? is given by the formula 


1 
Ly) = firwldr. (O=2.0 =o 
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Fig.1.1 A chart 
) 
— 


so the user will want the chart ¢ to satisfy L(y) = L(¢ 0 y) for all curves y. It is a 
consequence of the Theorema Egregium that there is no such chart. 

Perhaps the user of such a map will be content to use the map to plot the shortest 
path between two points p and g in U. This path is called a geodesic and is denoted 
by Ypq- It satisfies L(Ypq) = du (p,q), where 


du (p,q) = inf{L(y) |y@ € U, y(0) = p, yC) = g} 


so our less demanding user will be content if the chart @ satisfies 


du (p,q) = de(h(p), 6(q)), 


where dz(¢(p), d(q)) is the length of the shortest path in the plane. It is also a 
consequence of the Theorema Egregium that there is no such chart. 

Now suppose our user is content to have a map which makes it easy to navigate 
close to the shortest path connecting two points. Ideally the user would use a straight 
edge, magnetic compass, and protractor to do this. S/he would draw a straight line 
on the map connecting p and g and steer a course which maintains a constant angle 
(on the map) between the course and meridians. This can be done by the method 
of stereographic projection. This chart is conformal (which means that it preserves 
angles). According to Wikipedia stereographic projection was known to the ancient 
Greeks and a map using stereographic projection was constructed in the early 16th 
century. Exercises 3.7.5, 3.7.12, and 6.4.22 use stereographic projection; the latter 
exercise deals with the Poincaré model of the hyperbolic plane. The hyperbolic 
plane provides a counterexample to Euclid’s Parallel Postulate. 


Exercise 1.1.1 It is more or less obvious that for any surface M C R° there is a 
unique shortest path in M@ connecting two points if they are sufficiently close. (This 
will be proved in Theorem 4.5.3.) This shortest path is called the minimal geodesic 
connecting p and q. Use this fact to prove that the minimal geodesic joining two 
points p and q in S? is an arc of the great circle through p and q. (This is the 
intersection of the sphere with the plane through p, q, and the center of the sphere.) 
Also prove that the minimal geodesic connecting two points in a plane is the straight 
line segment connecting them. Hint: Both a great circle in a sphere and a line in a 
plane are preserved by a reflection. (See also Exercise 4.2.5 below.) 


Exercise 1.1.2 Stereographic projection is defined by the condition that for p € 
S? \ n the point ¢(p) lies in the xy-plane z = 0 and the three points n = (0,0, 1), 
p, and ¢(p) are collinear (see Fig. 1.2). Using the formula that the cosine of the 
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Fig. 1.2 Stereographic Pro- n 


jection P 
9(p) 


angle between two unit vectors is their inner product prove that @ is conformal. 
Hint: The plane through p, q, and n intersects the x y-plane in a straight line and 
the sphere in a circle through n. The plane through n, p, #(p), and the center of 
the sphere intersects the sphere in a meridian. A proof that stereographic projection 
is conformal can be found in [27, page 248]. The proof is elementary in the sense 
that it doesn’t use calculus. An elementary proof can also be found online at http:// 
people.reed.edu/~jerry/3 | 1/stereo.pdf. 


Exercise 1.1.3. It may seem fairly obvious that you can’t draw an accurate map of 
a portion of the earth because the sphere is curved. However, the cylinder 


C={(x,y2€RI/?4+y=} 


is also curved, but the map yw : R*? > C defined by w(s,t) = (cos(t), sin(t), s) 
preserves lengths of curves, i.e. L(w o y) = L(y) for any curve y : [a,b] > R?. 
Prove this. 


Standard Notations The standard notations N, No, Z, Q, R, C denote respec- 
tively the natural numbers (= positive integers), the non-negative integers, the inte- 
gers, the rational numbers, the real numbers, and the complex numbers. We denote 
the identity map of a set X by idy and the n x n identity matrix by 1, or simply 1. 
The notation V* is used for the dual of a vector space V, but when K is a field such 
as R or C the notation K* is sometimes used for the multiplicative group K \ {0}. 
The terms smooth, infinitely differentiable, and C©™ are all synonymous. 


1.2 Coordinates 


The rest of this chapter defines the category of smooth manifolds and smooth maps 
between them. Before giving the precise definitions we will introduce some termi- 
nology and give some examples. 


Definition 1.2.1. A chart ona set M is a pair (¢,U) where U is a subset of M 
and @: U — (U) is a bijection from U to an open set @(U) in R”. An atlas on 
M is acollection 


A= (ga, Us )}aca 


of charts such that the domains U, cover M,i.e. M = Vee Ava: 
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The idea is that if 6(p) = (x1(p),...,Xm(p)) for p € U, then the functions x; 
form a system of local coordinates defined on the subset U of M. The dimension of 
M should be m since it takes m numbers to uniquely specify a point of U. We will 
soon impose conditions on charts (¢, U), however for the moment we are assuming 
nothing about the maps ¢ (other than that they are bijective). 


Example 1.2.2 Every open subset U C R” has an atlas consisting of a single 
chart, namely (@,U) = (idy, U) where idy denotes the identity map of U. 


Example 1.2.3. Assume that 2 C R” is an open set, that M is a subset of the 
product R” x R" = R”*", and that h : 2 — R" is acontinuous map whose graph 
is M, i.e. 


graph(h) := {(x,y)€ 2x R"|y =h(x)} = M. 


Let U = graph(h) = M and let (x, y) = x be the projection of U onto 2. Then 
the pair (¢, U) is a chart on M. The inverse map is given by 


g(x) = (x, h(x) 
for x € 82 = @(U). Thus M has again an atlas consisting of a single chart. 
Example 1.2.4 The m-sphere 


S” = {p = (x0,...,Xm) E R™* | xg + ty, = 1} 
has an atlas consisting of the 2m + 2 charts ¢;4 : U;z — D” where D” is the 
open unit disk in R”, U;. ={p € S” | +x; > O}, and ¢;~ is the projection which 
discards the ith coordinate. (See Example 2.1.14 below.) 


Example 1.2.5 Let 
A= Al E R@tDxm+) 
be a symmetric matrix and define a quadratic form F : R”*t! > R by 
FO@)=p'Ad, P= Olse5s3%m): 
After a linear change of coordinates the function F has the form 
F(p) = 9 bo + xh — Shp Hp 
(Here r + 1 is the rank of the matrix A.) The set M = F~!(1) has an atlas of 2m +2 
charts by the same construction as in Example 1.2.4, in fact S$” is the special case 


where A = 1,41, the (m+ 1) x (m+ 1) identity matrix. (See Example 2.1.13 below 
for another way to construct charts.) 
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Unit Sphere et+yt2=1 


Ellipsoid ae es 


Cylinder et+y=1 
Elliptic aa 
Hyperboloid st+sa5-aH=l 


(of one sheet) 


Elliptic 
Hyperboloid a +2. ane =] 
(of two sheets) 


Hyperbolic 4 2 
: £ y 
Paraboloid a ZR 
Elliptic 3 5 
: x y 
Paraboloid ZS a + RB 


Fig. 1.3 Quadric Surfaces 


Figure 1.3 enumerates the familiar quadric surfaces in R*. The paraboloids are 
examples of graphs as in Example 1.2.3 with @ = R* andn = 1, and the ellipsoid 
and the two hyperboloids are instances of the quadric hypersurfaces defined in 
Example 1.2.5. The sphere is an instance of the ellipsoid (with a = b = c = 1) 
and the cylinder is a limit of the ellipsoid as well as of the elliptic hyperboloid of 
one sheet (as a = b = 1 and c > oo). The pictures were generated by computer 
using the parameterizations 


x = acos(t) cos(s), y = bsin(t) cos(s), Zz =csin(s) 
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for the ellipsoid, 

x = acos(t) cosh(s), y = bsin(t) cosh(s), z =csinh(s), 
for the elliptic hyperboloid of one sheet, and 

x = acos(t) sinh(s), y = bsin(f) sinh(s), z = +c cosh(s) 


for the elliptic hyperboloid of two sheets. These quadric surfaces will be often used 
in the sequel to illustrate important concepts. 

In the following two examples K denotes either the field R of real numbers or 
the field C of complex numbers, K* := {A € K | A 4 0} denotes the corresponding 
multiplicative group, and V denotes a vector space over K. 


Example 1.2.6 The projective space of V is the set of lines (through the origin) 
in V. In other words, 


P(V) ={€C V |£ isa 1-dimensional K-linear subspace} 


When K = R and V = R”*! this is denoted by RP” and when K = C and 
V =C""*! this is denoted by CP”. For our purposes we can identify the spaces 
C+! and R?”*? but the projective spaces CP” and RP?” are very different. The 
various lines £ € P(V) intersect in the origin, however, after the identification 
P(V)={[v] |v eV \ {0}} with [v] := K*v = Kv \ {0} the elements of P(V) become 
disjoint, i.e. P(V) is the set of equivalence classes of an equivalence relation on the 
open set V \ {0}. Assume that V = K’"*! and define an atlas on P(V) as follows. 
For each integer i = 0,1,...,m define U; := {[v] | v = (xo,...,Xm), X; 4 O} and 
define ¢; : U; — K” by 


gi([v)) = (=... 38), 


Xe oe x: 
This atlas consists of m + 1 charts. 
Example 1.2.7 For each positive integer & the set 

Gi (V) := {€ C V | £is a k-dimensional K-linear subspace} 


is called the Grassmann manifold of k-planes in V. Thus G|(V) = P(V). As- 
sume that V = K” and define an atlas on G;(V) as follows. Let e;,...,e, be the 
standard basis for K”, i.e. e; is the ith column of the n x n identity matrix 1,,. Each 
partition {1,2,...,.m7}=7UJ,1I = {i <---<ig}, J = {i <-++ < jy_x} of the 
first n natural numbers determines a direct sum decomposition K” = V = V; @ V; 
via the formulas 


V; = Ke;, + --- + Ke;,, V7 = Ke;, +--+: + Ke;,_,. 
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Let U; denote the set of all k-planes € € G;,(V) which are transverse to V;, 
i.e. such that £M Vy; = {0}. The elements of U; are precisely those k-planes 
of the form £ = graph(A), where A: V; > V, is a linear map. Define the map 
gr . Uy > K**<-) by the formula 
n—-k 
or (€) = (ays), Aé;, = Yo arsjy. 2) ee 
s=1 


Exercise: Prove that the set of all pairs (¢;, U;) as I ranges over the subsets of 
{1,...,} of cardinality k form an atlas. 


1.3. Topological Manifolds* 


Definition 1.3.1 A topological manifold is a topological space M such that each 
point p € M has an open neighborhood U which is homeomorphic to an open 
subset of a Euclidean space. 


Brouwer’s Invariance of Domain Theorem asserts that, when U C R” and 
V Cc R" are nonempty open sets and ¢ : U > V isahomeomorphism, thenm = n. 
This means that if M is a connected topological manifold and some point of M has 
a neighborhood homeomorphic to an open subset of R”, then every point of M has 
a neighborhood homeomorphic to an open subset of that same IR”. In this case we 
say that M has dimension m or is m-dimensional or is an m-manifold. Brouwer’s 
theorem is fairly difficult (see [24, p. 126] for example) but if ¢ is a diffeomor- 
phism, then the result is an easy consequence of the invariance of the rank in linear 
algebra and the chain rule. (See equation (1.4.1) below.) 

By definition, a topological m-manifold M admits an atlas where every chart 
(¢, U) of the atlas is ahomeomorphism ¢ : U — $(U) from an open set U C M to 
an open set (U) C R’”. The following definition and lemma explain when a given 
atlas determines a topology on M. 


Definition 1.3.2 Let M be a set. Two charts (¢;, U;) and (#2, U2) on M are said 
to be topologically compatible iff ¢;(U; M U2) and ¢2(U; MN U2) are open subsets 
of R” and the transition map 


da = 2 og, 2 @(U; N U2) > d2(U, N U2) 


is ahomeomorphism. An atlas is said to be a topological atlas iff any two charts in 
this atlas are topologically compatible. 


Lemma 1.3.3. Let A = {(¢y, Uw)}aca be an atlas ona set M. 
(i) IfA is a topological atlas, then the collection 


gy(U M Uy) is an open subset of R™ 


U:= 
for everya eA 


(1.3.1) 


ucM| 
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is a topology on M, and with this topology each Uy is an open subset of M 
and each $y is a homeomorphism. Thus M is a topological manifold with the 
topology (1.3.1). 

(i) Jf M is a topological manifold and each Uy is an open set and each dy is a 
homeomorphism, then A is a topological atlas and the topology (1.3.1) coin- 
cides with the topology of M. 


Proof Exercise. Oo 


If M is a topological manifold, then the collection of all charts (U, #) on M such 
that U is open and ¢ is a homeomorphism is a topological atlas. It is the unique 
maximal topological atlas in the sense that it contains every other topological atlas 
as in part (ii) of Lemma 1.3.3. However, we will often consider smaller, even finite, 
topological atlases, and by Lemma 1.3.3 each of these determines the topology 
of M. 


Exercise 1.3.4 Show that the atlas in each example in Sect. 1.2 is a topological 
atlas. Conclude that each of these examples is a topological manifold. 


Any subset S C X of a topological space X inherits a topology from X, called 
the relative topology of S. A subset Up C S is called relatively open in S (or 
S-open) iff there is an open set U C X such that U) = UNM S.A subset Ay C S 
is called relatively closed (or S-closed) iff there is a closed set AC X such 
that Ag = ANS. The relative topology on S is the coarsest topology such that the 
inclusion map S — X is continuous. 


Exercise 1.3.5 Show that the relative topology satisfies the axioms of a topology 
(i.e. arbitrary unions and finite intersections of S-open sets are S-open, and the 
empty set and S' itself are S-open). Show that the complement of an S-open set in 
S is S-closed and vice versa. 


Exercise 1.3.6 Each of the sets defined in Examples 1.2.2, 1.2.3, 1.2.4, and 1.2.5 
is a subset of some Euclidean space R“. Show that the topology in Exercise 1.3.4 is 
the relative topology inherited from the topology of R“. The topology on Ré is of 
course the metric topology defined by the distance function d(x, y) = |x — y|. 


If ~ is an equivalence relation on a topological space X, then the quotient 
space Y := X/~ := {[x]|x € X} is the set of all equivalence classes [x] := {x’ € 
X |x'~ x}. The map z : X — Y defined by z(x) = [x] will be called the obvious 
projection. The quotient space inherits the quotient topology from Y. Namely, a 
set V C Y is open in this topology iff the preimage z~!(V) is open in X. This 
topology is the finest topology on Y such that projection z : X — Y is continuous. 
Since the operation V +> 2~!(V) commutes with arbitrary unions and intersections 
the quotient topology obviously satisfies the axioms of a topology. 
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Exercise 1.3.7 Show that the topology on the projective space P(V) in Exam- 
ple 1.2.6 determined by the atlas is the quotient topology inherited from the open 
set V \ {0}. Express the Grassmann manifold G;(V) in Example 1.2.7 as a quotient 
space and show that the topology determined by the atlas is the quotient topology. 
(Recall that in both examples V = K” with K = R or K = C.) 


1.4 Smooth Manifolds Defined* 


Let U C R" and V C R” be open sets. A map f : U > V is called smooth iff it 
is infinitely differentiable, 1.e. iff all its partial derivatives 


Quite ton f 


Qa Qn? 
Oxy) +++ OXn" 


of = a = (aj,...,,) € NO, 
exist and are continuous. In later chapters we will sometimes write C°(U, V) for 
the set of smooth maps from U to V. 


Definition 1.4.1 Let U C R” and V C R” be open sets. For a smooth map f = 
(fi.---, fm): U — V anda point x € U the derivative of f at x is the linear map 
df (x) : R" — R” defined by 


F(x + t&) — fF) 
t 


d 
Af (x)E = a ; & € R”. 


f(x + t&) = lim 
1=0 t—0 


This linear map is represented by the Jacobian matrix of f at x which will also be 
denoted by 


of; of 
rma ee a 
df (x) := : e€ R™*", 
Ofin Ofin 
a) EO) 


Note that we use the same notation for the Jacobian matrix and the corresponding 
linear map from R” to R”. 


The derivative satisfies the chain rule. Namely, if U CR", V CR”, andW Cc Ré 
are open sets and f : U + V and g: V — W are smooth map, then go f:U > W 
is smooth and 


d(go f)(x) = dg(f(x)) odf(x): R" > R® (1.4.1) 


for every x € U. Moreover the identity map idy : U — U is always smooth and its 
derivative at every point is the identity map of R”. This implies that, if f : U > V 
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is a diffeomorphism (i.e. f is bijective and f and f~! are both smooth), then its 
derivative at every point is an invertible linear map. This is why the Invariance of 
Domain Theorem (discussed after Definition 1.3.1) is easy for diffeomorphisms: if 
f :U —V isa diffeomorphism, then the Jacobian matrix df(x) € R’”*” is invertible 
for every x € U and som = n. The Inverse Function Theorem (see Theorem A.2.2 
in Appendix A.2) is a kind of converse. 


Definition 1.4.2 (Smooth manifold) Let M be a set. A chart on M is a pair 
(f,U) where U C M and ¢ is a bijection from U to an open subset 6(U) C R” 
of some Euclidean space. Two charts (¢;, U;) and (¢2, U2) are said to be smoothly 
compatible iff 6, (U; 0 U2) and ¢2(U, N U2) are both open in R” and the transition 
map 


dr = $206)! + o1(U1 N Ur) > b2(U1 N U2) (1.4.2) 


is a diffeomorphism. A smooth atlas on M is a collection A of charts on M any 
two of which are smoothly compatible and such that the sets U, as (¢, U) ranges 
over the elements of A, cover M (i.e. for every p € M there is a chart (¢,U) € A 
with p € U). A maximal smooth atlas is an atlas which contains every chart which 
is smoothly compatible with each of its members. A smooth manifold is a pair 
consisting of a set M and a maximal smooth atlas A on M. 


Lemma 1.4.3. If A is a smooth atlas, then so is the collection A of all charts 
smoothly compatible with each member of A. The smooth atlas A is obviously max- 
imal. In other words, every smooth atlas extends uniquely to a maximal smooth 
atlas. 


Proof Let (¢;, U;) and (#2, U2) be charts in A and let x € ¢;(U; MN U2). Choose a 
chart (#, U) € A such that $, | (x) € U. Then ¢,(U MU, OM U3) is an open neigh- 
borhood of x in IR” and the transition maps 


$0¢6;):d(UNU NW) > 6(UNU, NW), 
d2°0¢ 7! : (UNUM U2) > o2o(U NU, N U2) 


are smooth by definition of A. Hence so is their composition. This shows that the 
map ¢ ° g, | > bi (U, A U2) > d2(U; A U2) is smooth near x. Since x was chosen 
arbitrary, this map is smooth. Apply the same argument to its inverse to deduce that 
it is a diffeomorphism. Thus A is a smooth atlas. O 


Definitions 1.4.2 and 1.3.2 are mutatis mutandis the same, so every smooth atlas 
ona set M is a fortiori a topological atlas, i.e. every smooth manifold is a topolog- 
ical manifold. (See Lemma 1.3.3.) Moreover the definitions are worded in such a 
way that it is obvious that every smooth map is continuous. 
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Exercise 1.4.4 Show that each of the atlases from the examples in Sect. 1.2 is 
a smooth atlas. (You must show that the transition maps from Exercise 1.3.4 are 
smooth.) 


When A is a smooth atlas on a topological manifold M one says that A is a 
smooth structure on the (topological) manifold M iff A Cc B, where B is the 
maximal topological atlas on M. When no confusion can result we generally drop 
the notation for the maximal smooth atlas as in the following exercise. 


Exercise 1.4.5 Define the notion of a continuous map between topological mani- 
folds and of a smooth map between smooth manifolds via continuity, respectively 
smoothness, in local coordinates. Let M, N, and P be smooth manifolds and 
f :M — WN and g: N > P be smooth maps. Prove that the identity map idjy 
is smooth and that the composition g o f : M — P is a smooth map. (This is of 
course an easy consequence of the chain rule (1.4.1).) 


Remark 1.4.6 It is easy to see that a topological manifold can have many dis- 
tinct smooth structures. For example, {(idg, R)} and {(#, R)} where #(x) = x? are 
atlases on the real numbers which extend to distinct smooth structures but deter- 
mine the same topology. However these two manifolds are diffeomorphic via the 
map x +> x!/3, In the 1950’s it was proved that there are smooth manifolds which 
are homeomorphic but not diffeomorphic and that there are topological manifolds 
which admit no smooth structure. In the 1980’s it was proved in dimension m = 4 
that there are uncountably many smooth manifolds that are all homeomorphic to 
IR* but no two of them are diffeomorphic to each other. These theorems are very 
surprising and very deep. 


A collection of sets and maps between them is called a category iff the col- 
lection of maps contains the identity map of every set in the collection and the 
composition of any two maps in the collection is also in the collection. The sets 
are called the objects of the category and the maps are called the morphisms of the 
category. An invertible morphism whose inverse is also in the category is called an 
isomorphism. Some examples are the category of all sets and maps, the category 
of topological spaces and continuous maps (the isomorphisms are the homeomor- 
phisms), the category of topological manifolds and continuous maps between them, 
and the category of smooth manifolds and smooth maps (the isomorphisms are the 
diffeomorphisms). Each of the last three categories is a subcategory of the preceding 
one. 

Often categories are enlarged by a kind of “gluing process”. For example, the 
“global” category of smooth manifolds and smooth maps was constructed from the 
“local” category of open sets in Euclidean space and smooth maps between them 
via the device of charts and atlases. (The chain rule shows that this local category 
is in fact a category.) The point of Definition 1.3.2 is to show (via Lemma 1.3.3) 
that topological manifolds can be defined in a manner analogous to the definition 
we gave for smooth manifolds in Definition 1.4.2. 
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Other kinds of manifolds (and hence other kinds of geometry) are defined by 
choosing other local categories, i.e. by imposing conditions on the transition maps 
in Equation (1.4.2). For example, a real analytic manifold is one where the transition 
maps are real analytic, a complex manifold is one whose coordinate charts take 
values in C” and whose transition maps are holomorphic diffeomorphisms, and a 
symplectic manifold is one whose coordinate charts take values in R~” and whose 
transition maps are canonical transformations in the sense of classical mechanics. 
Thus CP” is a complex manifold and RP” is a real analytic manifold. 


1.5 The Master Plan 


In studying differential geometry it is best to begin with extrinsic differential geom- 
etry which is the study of the geometry of submanifolds of Euclidean space as in 
Examples 1.2.3 and 1.2.5. This is because we can visualize curves and surfaces in 
I>. However, there are a few topics in the later chapters which require the more ab- 
stract Definition 1.4.2 even to say interesting things about extrinsic geometry. There 
is a generalization to these manifolds involving a structure called a Riemannian met- 
ric. We will call this generalization intrinsic differential geometry. Examples 1.2.6 
and 1.2.7 fit into this more general definition so intrinsic differential geometry can 
be used to study them. 

Since an open set in Euclidean space is a smooth manifold the definition of a 
submanifold of Euclidean space (see Sect. 2.1 below) is mutatis mutandis the same 
as the definition of a submanifold of a manifold. The definitions in Chap. 2 are 
worded in such a way that it is easy to read them either extrinsically or intrinsically 
and the subsequent chapters are mostly (but not entirely) extrinsic. Those sections 
which require intrinsic differential geometry (or which translate extrinsic concepts 
into intrinsic ones) are marked with a *. 


Check for 
updates 


Foundations 


This chapter introduces various fundamental concepts that are central to the fields 
of differential geometry and differential topology. Both fields concern the study of 
smooth manifolds and their diffeomorphisms. The chapter begins with an introduc- 
tion to submanifolds of Euclidean space and smooth maps (Sect. 2.1), to tangent 
spaces and derivatives (Sect. 2.2), and to submanifolds and embeddings (Sect. 2.3). 
In Sect. 2.4 we move on to vector fields and flows and introduce the Lie bracket 
of two vector fields. Lie groups and their Lie algebras, in the extrinsic setting, are 
the subject of Sect. 2.5, which includes a proof of the Closed Subgroup Theorem. 
In Sect. 2.6 we introduce vector bundles over a manifold as subbundles of a trivial 
bundle and in Sect. 2.7 we prove the theorem of Frobenius. The last two sections of 
this chapter are concerned with carrying over all these concepts from the extrinsic 
to the intrinsic setting and can be skipped at first reading (Sects.2.8 and 2.9). 


2.1 Submanifolds of Euclidean Space 


To carry out the Master Plan Sect. 1.5 we must (as was done in [50]) extend the 
definition of smooth map to maps f : X — Y between subsets ¥ C R* and Y c R° 
which are not necessarily open. In this case a map f : X — Y is called smooth iff 
for each xy € X there exists an open neighborhood U C R* of xo and a smooth 
map F : U — R° that agrees with f on UM X. A map f : X > ¥Y is called a 
diffeomorphism iff f is bijective and f and f—! are smooth. When there exists 
a diffeomorphism f : X — Y then X and Y are called diffeomorphic. When X 
and Y are open these definitions coincide with the usage in Sect. 1.4. 


Exercise 2.1.1 (Chain rule) Let X C R“, Y CR‘, Z C R” be arbitrary subsets. 
If f :X — Y and g: Y —> Z are smooth maps, then so is the composition go f : 
X — Z.The identity map id: X — X is smooth. 
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Fig. 2.1 A coordinate chart UnM M 6 Q 

¢:UNM+2 —- ap 
EE» r 

Exercise 2.1.2. Let E C R* be an m-dimensional linear subspace and let vj,..., Um 


be a basis of E. Then the map f : R” > E defined by f(x) := )7yL, xiv; is a 
diffeomorphism. 


Definition 2.1.3 Let k,m € No. A subset M Cc R*é is called a smooth m- 
dimensional submanifold of R“ iff every point p € M has an open neighbor- 
hood U C Ré such that U N M is diffeomorphic to an open subset 2 C R”. A 
diffeomorphism 


¢:UNM>Q2 
is called a coordinate chart of / and its inverse 
W:i=o@':2>UNM 
is called a (smooth) parametrization of U N M (see Fig. 2.1). 


In Definition 2.1.3 we have used the fact that the domain of a smooth map can be 
an arbitrary subset of Euclidean space and need not be open. The term m-manifold 
in R* is short for m-dimensional submanifold of R*. In keeping with the Master 
Plan Sect. 1.5 we will sometimes say manifold rather than submanifold of R* to 
indicate that the context holds in both the intrinsic and extrinsic settings. 


Lemma 2.1.4 If M C R* is anonempty smooth m-manifold, then m < k. 


Proof Fix an element po € M, choose a coordinate chart 6: UN M > Q 
with po € U and values in an open subset 2 C R”, and denote its inverse by 
v:=¢ !:2—+UMM Shrinking U, if necessary, we may assume that ¢ extends 
to asmooth map ® : U — R”’. This extension satisfies ®((x)) = d(W(x)) = x 
and hence d®(w(x))dw(x) = id: R” > R” for all x € Q, by the chain rule. 
Hence the derivative d(x) : R’ — R* is injective for all x € 2, and hence m < k 
because {2 is nonempty. This proves Lemma 2.1.4. Oo 


Example 2.1.5 Consider the 2-sphere 
M :=S? = {x.y 2eER|er+y4+7= 1\ 
depicted in Fig. 2.2 and let U C R? and 2 C R? be the open sets 


U:={(x,y,z)€R|z>0}, W:={(x,y)eR |x? +y? < lh. 
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Fig. 2.2 The 2-sphere and 
the 2-torus >) 


The map ¢?¢: UMM — 2 given by 


P(x, y,Z) = (x,y) 


is bijective and its inverse y := @-! : Q > UN M is given by 


Vx,y) = (x,y, ¥1—x?—y?). 


Since both ¢ and wy are smooth, the map ¢ is a coordinate chart on S*. Similarly, 
we can use the open sets z < 0, y > 0, y < 0, x > 0, x < 0 to cover S* by six 
coordinate charts. Hence S$? is a manifold. A similar argument shows that the unit 
sphere S” Cc R”+! (see Example 2.1.14 below) is a manifold for every integer 
m= 0. 


Example 2.1.6 Let @ C R” be an open set and h : 2 — R‘" be a smooth map. 
Then the graph of h is a smooth submanifold of R” x R‘-” = Ré: 


M := graph(h) := {(x, y) |x € 2, y = h(x)}. 


It can be covered by a single coordinate chart ¢ : UM M > 92, where U := Q x 
R*-”, @ is the projection onto 2, and wy := ¢7! : 2 > U is given by W(x) = 
(x,h(x)) for x € Q. 


Exercise 2.1.7 (The case m = 0) Show that a subset M C R* is a 0-dimensional 
submanifold if and only if M is discrete, i.e. for every p € M there exists an open 
set U C R* such that UN M = {pt}. 


Exercise 2.1.8 (The case m =k) Show that a subset M C R” is an m-dimensional 
submanifold if and only if M is open. 


Exercise 2.1.9 (Products) If M; C R* is an m,-manifold for i = 1,2, show that 
M, x Mz is an (m, + m2)-dimensional submanifold of R*'+2_ Prove that the m- 
torus T’” := (S$!) is a smooth submanifold of C’”. 


The next theorem characterizes smooth submanifolds of Euclidean space. In 
particular condition (iii) will be useful in many cases for verifying the manifold 
condition. We emphasize that the sets Up := U M M that appear in Definition 2.1.3 
are open subsets of M with respect to the relative topology that M inherits from 
the ambient space R* and that such relatively open sets are also called M-open 
(see Sect. 1.3). 
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Fig. 2.3. Submanifolds of U Q 


Euclidean space CNB) Se cS 


Theorem 2.1.10 (Manifolds) Let m andk be integers withO<m<k. Let M CR* 
be a set and po € M. Then the following are equivalent. 


G) There exists an M-open neighborhood Uy C M of po and a diffeomorphism 
9 : Uy Qo onto an open set 29 C R”. 
(ii) There exist open sets U, 2 C R* and a diffeomorphism ¢ : U > Q such that 
Po € U and 


¢(U NM) = 2N(R™ x {03) 


(see Fig. 2.3). 

(iii) There exists an open set U C R* anda smooth map f : U — R*~ such 
that po € U, the derivative df (p) : R‘ > R‘ is surjective for every point 
peUNM, and 


UNM =f '(0)={pEU | f(p) = 9}. 


Moreover, if (i) holds, then the diffeomorphism ¢ : U — Q in (ii) can be chosen 
such that U 1M C Up and ¢(p) = (¢0(p), 9) for every pe UNM. 


Proof First assume (ii) and denote the diffeomorphism in (ii) by 
b = (G1. G2...) US ACR. 
Then part (i) holds with Up := UN M, Qo := {x € R” | (x, 0) € Q}, and 
ho := (f1,.--,dm)|uy : Uo > Qo, 


and part (iii) holds with f := (m41,....0¢) : U > R*&”. This shows that 
part (ii) implies both (i) and (iii). 

We prove that (1) implies (11). Let ¢9 : Up — 20 be the coordinate chart in 
part (i), let Yo:= fo! : Qo — Up be its inverse, and let x9 := ¢o(po) € Qo. 
Then the derivative dyo(xo) : R” — R* is injective by Lemma 2.1.4. Hence 
there exists a matrix B € R‘*“*-™ such that det([dyo(xo) B]) 4 0. Define the 
map y : Qy x R-”" — Ré by 


W(x, y) = Wolx) + By. 


Then the k x k-matrix dy (xo, 0) = [dWo(xo) B] € R*** is nonsingular, by choice 
of B. Hence, by the Inverse Function Theorem A.2.2, there exists an open 
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neighborhood 2 Cox R*—™ of (xo,0) such that U:= V(Q) CRS is open 
and wiz: 2 > UT isa diffeomorphism. In particular, the restriction of y to Q is 
injective. Now the set {x € 29|(x,0) € Qi is open and contains x9. Hence the set 


Uo := {Wolx) | x € Qo, (x,0) € 2} = {p € Up| (Go(p),0) € 2} CM 


is M-open and contains po. Hence, by the definition of the relative topology, there 
exists an open set W C R* such that Uj) = WN M. Define 


U:=UNW, 2:= 20). 


Then UN M = Uv and wy restricts to a diffeomorphism from (2 to U. 
Now let (x, y) € 82. We claim that 


w(x,y)—eM a y =0. (2.1.1) 


If y = 0, then obviously w(x, y) = Wo(x) € M. Conversely, let (x,y) € 2 and 
suppose that p := w(x,y)¢M. Then peEUNM=UNWNM =UoCU 
and hence (¢o(p), 0) € 2, by definition of Uo. This implies 


V (Go(P), 9) = Yolho(p)) = p = W(x, y). 


Since the pairs (x, y) and (¢o(p), 0) both belong to the set Q and the restriction 
of y to Q is injective we obtain x = ¢o(p) and y = 0. This proves (2.1.1). It 
follows from (2.1.1) that the map ¢ := (W|qg)7! : U > @ satisfies ¢(U N M) = 
{(x,y) € 2| W(x, y) € M} = 22 (R” x {0}). Thus we have proved that (i) im- 
plies (ii). 

We prove that (iii) implies (ii). Let f : U > R—" be as in part (iii). Then po € U 
and the derivative df (po) : R“ — R*~” is a surjective linear map. Hence there 
exists a matrix A € R”** such that 


A 
et 0 
ey e 
Define the map ¢ : U > R* by 


| 4? 
b(p) := GA for p € U. 


Then det(d¢(po)) 4 0. Hence, by the Inverse Function Theorem A.2.2, there exists 
an open neighborhood U’ C U of po such that 2’ := $(U’) is an open subset of R* 
and the restriction ¢’ := ¢|y : U’ — @' is a diffeomorphism. In particular, the 
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restriction ¢|y is injective. Moreover, it follows from the assumptions on f and 
the definition of ¢ that 


U'N M ={peU'| f(p) =0} = {p € U' | o(p) € R” x {0}} 


and so ’(U'N M) = 2' 1 (R” x {0}). Hence the diffeomorphism ¢ : U’! > Q’ 
satisfies the requirements of part (ii). This proves Theorem 2.1.10. Oo 


The next corollary relates the notion of a smooth map on a smooth submanifold 
as defined in the beginning of Sect. 2.1 to the standard notion of smoothness in local 
coordinates used in the intrinsic setting of Sect. 2.8 below. 


Corollary 2.1.11 Let M C R* be a smooth m-dimensional submanifold and 
let f : M — R° be amap. Then the following are equivalent. 


(i) For every po € M there exists an open neighborhood U C R* of po and a 
smooth map F : U — R° that agrees with f onU NM. 

(i) If Uo C M is an M-open set and $9 : Up — {2 is a diffeomorhism onto an 
open set 29 C R”, then the composition f o dy! : Qo > R° is smooth. 


Proof Assume (ii), let pp € M, and choose @ = (¢),...,¢¢): U > QC R* as 
in part (ii) of Theorem 2.1.10. Shrinking U, if necessary, we may assume that 
2 = Q) x 2,, where 29 C R” is an open set and £2) C R*—” is an open neigh- 
borhood of the origin. Then the map 2) > R°: xt f og !(x,0) is smooth by 
part (ii). Define F(p) := f o¢!(¢i(p),...,¢m(p),0,...,0) for p € U. Then the 
map F : U — R° is smooth and agrees with f on UM M. Thus f satisfies (i). 
That (i) implies (ii) follows from Exercise 2.1.1 and this proves Corollary 2.1.11. 


Definition 2.1.12 (Regular value) Let U C R* be an open set and let f : U > R¢ 
be a smooth map. An element c € R* is called a regular value of / iff, forall p< U, 
we have 


f(p) =e = df(p) : R* > R° is surjective. 


Otherwise c is called a singular value of f. Theorem 2.1.10 asserts that, if c is a 
regular value of /, then the preimage 


M:= f'(c)={peU| f(p) =e} 
is a smooth (k — £)-dimensional submanifold of R*. 
Examples and Exercises 


Example 2.1.13 Let A = A™ € R*** be a symmetric matrix and define the func- 
tion f : Rk > R by f(x) := x'Ax. Then df(x)& = 2x'AE for x,& € R* and 


2.1 Submanifolds of Euclidean Space 19 


hence the linear map df(x) : R — R is surjective if and only if Ax 4 0. Thus 
c = 0 is the only singular value of f and hence, for every element c € R \ {0}, 
the set M := f—!(c) = {x € R* | x'Ax = c} is a smooth manifold of dimension 
m=k-1. 


Example 2.1.14 (The sphere) As a special case of Example 2.1.13 consider the 
casek = m+ 1, A=1,andc = 1. Then f(x) = lx (? and so we have another proof 
that the unit sphere 


s7 = \x E Rt | |x|? _ 1\ 
in R’”t! is a smooth m-manifold. (See Examples 1.2.4 and 2.1.5.) 


Example 2.1.15 Define the map f : R? x R3 > R by f(x, y) := |x — y|’. This 
is another special case of Example 2.1.13 and so, for every r > 0, the set M := 
{(x, y) € R? x R3| |x — y| =r} is a smooth 5-manifold. 


Example 2.1.16 (The 2-torus) Let 0 <r < 1 and define f : R*? > R by 


(G42 HCP $F a2 $17 =4@ + YC" =P). 


This map has zero as a regular value and M := f~'!(0) is diffeomorphic to the 
2-torus T* = S$! x S!. An explicit diffeomorphism is given by 


eel) by ((1 +rcos(s)) cos(t), (1 + r cos(s)) sin(t), r sin(s)). 
This example corresponds to the second surface in Fig. 2.2. 


Exercise: Show that f(x, y, z) = 0 if and only if (\/x2 + y? + y2—1)? + 2? = r?. Ver- 
ify that zero is a regular value of /. 


Example 2.1.17 The set 
= Lo 72 eee AY) |x,y,z ER, 5g? yr 427 = 1} 


is a smooth 2-manifold in R°. To see this, define an equivalence relation on the unit 
sphere S* C R? by p ~ q iff q = +p. The quotient space (the set of equivalence 
classes) is called the real projective plane and is denoted by 


P? := §$?/{+1}. 
(See Example 1.2.6.) It is equipped with the quotient topology, i.e. a subset U C RP” 
is open, by definition, iff its preimage under the obvious projection S* —> RP? is an 


open subset of S?. Now the map f : S* > R°® defined by 


Jur 2H Fo 2 oe) 
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descends to a homeomorphism from RP? onto M. The submanifold M is covered 
by the local smooth parameterizations 


Q>M: (x,y) f(x,y, V1 — x2 - y?), 
Q—>M:(x,z)% f(x,Vv1—x? — 22,2), 


Q>M:(y,z)% f(V1—y2—- 22, 9,2), 

defined on the open unit disc 2 C R?. We remark the following. 

(a) M is not the preimage of a regular value under a smooth map R® > R*. 

(b) M is not diffeomorphic to a submanifold of R?. 

(c) The projection Y := {(yz, 2x, xy) lx,y,zER, x2-+y?2?4+277= 1} of M 
onto the last three coordinates is called the Roman surface and was discovered 
by Jakob Steiner. The Roman surface can also be represented as the set of 
solutions (€, 7, ¢) € R? of the equation n7¢? + ¢7&? + £7? = En. It is not a 
submanifold of R?. 


Exercise: Prove this. Show that M is diffeomorphic to a submanifold of R*. Show 
that M is diffeomorphic to RP” as defined in Example 1.2.6. 


Exercise 2.1.18 Let V : R” — R bea smooth function and define the Hamiltonian 
function H : R” x R” — R (kinetic plus potential energy) by 


Hx, y) = sbP +00, 
Prove that c is a regular value of H if and only if it is a regular value of V. 
Exercise 2.1.19 Consider the general linear group 
GL(n, R) = {g € R”*”" | det(g) 4 0} 
Prove that the derivative of the function f = det: R’*” — R is given by 
df (g)v = det(g) trace(g~!v) 
for all g € GL(n, R) and v € R”*”. Deduce that the special linear group 
SL(7, R) := {g € GL(v, R) | det(g) = 1} 


is asmooth submanifold of R””*”. 
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Example 2.1.20 The orthogonal group 
O(n) := {g € R"*" | g'g = 1} 

is a smooth submanifold of R”*”. To see this, denote by 

5, 7= {8 eR” |S’ = S} 
the vector space of symmetric matrices and define f : R"*” > &, by 

f(g) := 8's. 
Its derivative df(g) : R’*" — &,, is given by 
df(g)v = g'u+o'g. 


This map is surjective for every g € O(n): if g'g = 1 and S = S" € §,, then the 
matrix v := sgS satisfies 


1 1 1 1 
d = — T _ c = — — TP 2 
If (g)v 58 gS + 5 (85) g a5 + 55 S 


Hence 1 is a regular value of f and so O(7) is a smooth manifold. It has the dimen- 
sion 


n(n+1) _ n(n—1) 


dim O(n) = n? —dim8, =n? — 
2 2 


Exercise 2.1.21 Prove that the set 
M := {(x,y) € R?|xy =0} 


is not a submanifold of R?. Hint: If U C R? is a neighborhood of the origin and 
f :U — Risa smooth map such that U 1 M = f~!(0), then df(0,0) = 0. 


2.2 Tangent Spaces and Derivatives 


The main reason for first discussing the extrinsic notion of embedded manifolds 
in Euclidean space as explained in the Master Plan Sect. 1.5 is that the concept 
of a tangent vector is much easier to digest in the embedded case: it is simply the 
derivative of a curve in M, understood as a vector in the ambient Euclidean space 
in which M is embedded. 
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Fig. 2.4 The tangent space M 


T,M and the translated tan- 
ts T, M +TpM 
gent space p + T, TpM/ y, P: a 
0 


2.2.1 Tangent Space 


Definition 2.2.1 (Tangent vector) Let M Cc R* be a smooth m-dimensional man- 
ifold and fix a point p € M. A vector v € R* is called a tangent vector of M at p 
iff there exists a smooth curve y : R — M such that 


y(0) = p, y(0) =v. 
The set 
T,M := {y(0)|y : R > M is smooth, y(0) = p} 
of tangent vectors of M at p is called the tangent space of M at p. 


Theorem 2.2.3 below shows that 7; M is a linear subspace of R*. As does any 
linear subspace it contains the origin; it need not actually intersect M. Its trans- 
late p + T,M touches M at p; this is what you should visualize for 7,M (see 
Fig. 2.4). 


Remark 2.2.2 Let p € M beas in Definition 2.2.1 and let v € R*. Then 


de > 0 Ay: (—e,¢) > M_ such that 


€T,M => 
. : : is smooth, y(0) = p, y(0) = v. 


To see this suppose that y : (—e,e) > M is a smooth curve with y(0) = p and 
y(0) = v. Define y : R > M by 


eee et 
y(t) := (5), teER. 


Then ¥ is smooth and satisfies 7(0) = p and y(0) = v. Hence v € T,M. 


Theorem 2.2.3 (Tangent spaces) Let M C R* be a smooth m-dimensional man- 
ifold and fix a point p € M. Then the following holds. 


(i) Let Up C M be an M-open set with p € Uo and let do : Up > 20 be 
a diffeomorphism onto an open subset (29 CR”. Let xo := ¢o(p) and 
let Wo := | : 29 > Up be the inverse map. Then 


T,M = im (dyo(xo) : R™ > R*). 
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(ii) Let U,Q2 C R* be open sets and @ : U > & be a diffeomorphism such that 
p € U and ¢(U NM) = 22 (R” x {0}). Then 


T,M = do(p)'(R™ x {0}). 


(iii) Let U C R* be an open neighborhood of p and f : U > R*" be a smooth 
map such that 0 is a regular value of f andU 1M = f7'(0). Then 


T,M = ker df(p). 
(iv) 7,M is an m-dimensional linear subspace of R*. 


Proof Let Wo: 829 > Up and xo € Qo be as in (i) and let 6 : U > Q be as in (ii). 
We prove that 


imdwo(xo) C T,M Cc do(p)'(R” x {0}). (2.2.1) 
To prove the first inclusion in (2.2.1), choose a constant r > 0 such that 
B,(xo) := {x € R” | |x — xo] <r} C Q. 
Now let & € R” and choose € > 0 so small that 
elE| <r. 
Then x9 + t& € QQ for all t € R with |t| < e. Define y : (—e, e) > M by 
y(t) := Wo(xo + té) for —e<t <e. 


Then y is a smooth curve in M satisfying 


d 
v(0) = YWo(xo) = Pp, y(0) = a Wolxo + t&) = do(xo)é. 
=0 


b 


Hence it follows from Remark 2.2.2 that dWo(xo)& € T,M, as claimed. 

To prove the second inclusion in (2.2.1) we fix a vector v € 7, M. Then, by defini- 
tion of the tangent space, there exists a smooth curve y : R — M such that y(0) = p 
and (0) = v. Let U C R* be as in (ii) and choose ¢ > 0 so small that y(t) € U for 
|t| < e. Then 


p(y(t)) € $((U NM) CR™ x {0} 


for |t| < ¢ and hence 


d 
db(p)u = 4o(VO)VQ = 7 | H(y@) € R” x {0}. 
t=0 


This shows that v € dé(p)~!(R” x {0!) and thus we have proved (2.2.1). 
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Now the sets im dyo(xo) and dp(p)~!(R” x {0}) are both m-dimensional linear 
subspaces of R“. Hence it follows from (2.2.1) that these subspaces agree and that 
they both agree with 7, M. Thus we have proved assertions (1), (11), and (iv). 

We prove (iii). Let v € 7, M. Then there is a smooth curve y : R — M such that 
y(0) = pand (0) = v. For ¢ sufficiently small we have y(t) € U, where U Cc R* 
is the open set in (iii), and f(y(t)) = 0. Hence 


d 
df(p)v = df(y0))7(0) = — 


a _ =0. 


t 
This implies 7,M C ker df(p). Since T,M and the kernel of df(p) are both 
m-dimensional linear subspaces of R‘, we deduce that T,M = ker df(p). This 
proves part (iii) and Theorem 2.2.3. O 


Exercise 2.2.4 Let M C R* be a smooth m-dimensional manifold and let p; € M 
be a sequence that converges to a point p € M. Let t; be a sequence of nonzero real 
numbers and let v € R* such that 


: . Pi—P 
lim 7; = 0, lim —— _ = v. 
i—oo i>0 Tj 


Prove that v € 7,M. Hint: Use part (iii) of Theorem 2.2.3. 
Example 2.2.5 Let A = A’ € R‘** be a nonzero matrix as in Example 2.1.13 
and let c 4 0. Then part (iii) of Theorem 2.2.3 asserts that the tangent space of the 
manifold 

M= {x eR! |xTAx = cl 
at a point x € M is the (k — 1)-dimensional linear subspace 


iM = {é eR* |x" = 0}. 


Example 2.2.6 As a special case of Example 2.2.5 with A = 1 and c = 1 we 
find that the tangent space of the unit sphere S” C Rt! at a point x € S$” is the 
orthogonal complement of x. i.e. 


TS” = xt = {é E R11 | (x, &) _— O}. 
Here (x,&) = }°"., x;&; denotes the standard inner product on R™*!, 
Exercise 2.2.7 What is the tangent space of the 5-manifold 


M := {(x,y)€ R?x R?| |x—y| =r} 


at a point (x, y) € M? (See Exercise 2.1.15.) 
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Example 2.2.8 Let H(x,y):= Sly? + V(x) be as in Exercise 2.1.18 and let c be 
a regular value of H.If (x, y) € M := H7!(c), then 


Tx.yyM = {((&.n) € R" x R"| (yn) + (VV(x), €) = OF. 
Here VV := (0V/0x1,...,0V/0X,) : R” — R"” denotes the gradient of V. 
Exercise 2.2.9 The tangent space of SL(n, R) at the identity matrix is the space 
sl(n, R) := T1SL(n, R) = {€ € R””*" | trace(€) = 0} 
of traceless matrices. (Prove this, using Exercise 2.1.19.) 
Example 2.2.10 The tangent space of O(7) at g is 
T,0(n) = {v ER" | glu +vu'g = 0}. 


In particular, the tangent space of O(7) at the identity matrix is the space of skew- 
symmetric matrices 


o(n) := T;O(n) = {€ € R"™" |€7+ E = 0} 


To see this, choose a smooth curve R > O(n): t & g(t). Then g(t)'g(t) = 1 
for all ¢ €¢ R and, differentiating this identity with respect to ¢, we obtain 
g(t)'é(t) + g(t)'g(t) = 0 for every t. Hence every matrix v € T,O(n) satis- 
fies the equation g'v + v'g = 0. With this understood, the claim follows from the 
fact that g'v + v'g = O if and only if the matrix € := g~!v is skew-symmetric and 
that the space of skew-symmetric matrices in R”*” has dimension n(n — 1)/2. 


Exercise 2.2.11 Let 2 C R” be an open set and h : 2 — R*—” be a smooth map. 
Prove that the tangent space of the graph of h at a point (x, A(x)) is the graph of the 
derivative dh(x) : R” > R": 


M = {(x,h(x)) |x € 2}, Tene M = {E, dh(x)8)|€ ER"). 


Exercise 2.2.12 (Monge coordinates) Let M be a smooth m-manifold in R* 
and suppose that p € M is such that the projection 7, M — R” x {0} is invert- 
ible. Prove that there exists an open set 2 C R” and a smooth map h : Q > R‘—” 
such that the graph of / is an M-open neighborhood of p (see Example 2.1.6). Of 
course, the projection 7, M — IR” x {0} need not be invertible, but it must be in- 
vertible for at least one of the (*) choices of the m-dimensional coordinate plane. 
Hence every point of M has an M -open neighborhood which may be expressed as 
a graph of a function of some of the coordinates in terms of the others as in e.g. 
Example 2.1.5. 
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2.2.2 Derivative 

A key purpose behind the concept of a smooth manifold is to carry over the notion 
of a smooth map and its derivatives from the realm of first year analysis to the 
present geometric setting. Here is the basic definition. It appeals to the notion of 
a smooth map between arbitrary subsets of Euclidean spaces as introduced in the 


beginning of Sect. 2.1. 


Definition 2.2.13 (Derivative) Let M C R* be an m-dimensional smooth mani- 
fold and let 


f:M—>R‘ 
be a smooth map. The derivative of f at a point p € M is the map 
df(p):T,M > RB‘ 
defined as follows. Given a tangent vector v € T,M, choose a smooth curve 
y:R-M 
satisfying 
yYO=p, yO) =», 


and define the vector df(p)v € R° by 


SOM-$O) aay 


d 
df (p)v := ai fb YO) = lim 


That the limit on the right in equation (2.2.2) exists follows from our assump- 
tions. We must prove, however, that the derivative is well defined, i.e. that the right 
hand side of (2.2.2) depends only on the tangent vector v and not on the choice of 
the curve y used in the definition. This is the content of the first assertion in the next 
theorem. 


Theorem 2.2.14 (Derivatives) Let MC R* be anm-dimensional smooth manifold 
and f : M — R° bea smooth map. Fix a point p € M. Then the following holds. 


(i) The right hand side of (2.2.2) is independent of y. 
(ii) The map df(p) : T,M —> R° is linear. 
(iii) If N C R° is a smooth n-manifold and f(M) C N, then 


df(p)T,M C TypyN. 
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(iv) (Chain Rule) Let N be as in (iii), suppose that f(M) CN, and let g: N > R¢@ 
be a smooth map. Then 


d(go f)(p) = dg(f(p)) odf(p) : TM > R*. 
(v) If f =id: M > M, thendf(p) = id: T,M > T,M. 
Proof We prove (i). Let v € T,M and y : R > M be as in Definition 2.2.13. By 


definition there is an open neighborhood U C Ré of p andasmooth map F :U > R‘ 
such that 


F(p’) = f(p') forall pp) €e UN M. 


Let dF(p) € R@* denote the Jacobian matrix (i.e. the matrix of all first partial 
derivatives) of F at p. Then, since y(t) €e UN M fort sufficiently small, we have 


dF(p)v = dF(y(0))y(0) 


F(y(t)) 
=0 


dt 


t 


dt 


f(y@)). 
t=0 
The right hand side of this identity is independent of the choice of F while the left 
hand side is independent of the choice of y. Hence the right hand side is also inde- 
pendent of the choice of y and this proves (i). Assertion (ii) follows immediately 
from the identity 


df(p)v = dF(p)v 


just established. 
Assertion (iii) follows directly from the definitions. Namely, if y is as in Defini- 
tion 2.2.13, then B := f oy: R — N isasmooth curve in N satisfying 


B(0) = f(y) = f(p) =:4, ~~ BO) =df(p)v =: w. 


Hence w € 7T,N. Assertion (iv) also follows directly from the definitions. If 
g: N — R¢@ isa smooth map and £, q, w are as above, then 


d(go f)(p)v 


ai| 8 f0O) 


d 
ai|_,g8) 


dg(q)w 
= dg(f(p))df(p)v. 


This proves (iv). Assertion (v) follows again directly from the definitions and this 
proves Theorem 2.2.14. O 
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Corollary 2.2.15 (Diffeomorphisms) Let M C R* be a nonempty smooth m- 
manifold and N C R* be a smooth n-manifold and let f : M — N be a diffeomor- 
phism. Then m = n and the derivative df(p):T,M — Ty p)N is a vector space 
isomorphism with inverse 


df(p)' = df~'(f(p)) : Ty(p)N = T,M 
forall p€ M. 


Proof Define g:= f~!': N > M so that go f = idy and f og = idy. Then 
it follows from Theorem 2.2.14 that, for p € M and q := f(p) € N, we have 
dg(q)odf(p) =id:T,M — T,M and df(p) 0 dg(q) = id: T,N — T,N. Hence 
df(p):T,M — T,N is a vector space isomorphism and its inverse is given by 
dg(q)=df(p)':T,N — T,M. Hence m = n and this proves Corollary 2.2.15. O 


Exercise 2.2.16 Let M Cc R* be a smooth manifold and let f : M > R° bea 
smooth map. Let p; € M be a sequence that converges to a point p € M, let 1; be 
a sequence of nonzero real numbers, and let v € 7, M such that 


Pi7~P _ 


lim 7; = 0, lim v. 
i—0o i—>0oo Tj 
(See Exercise 2.2.4.) Prove that 
I _ aa. 
1>00o T 


L 


Hint: Use the local extension F of f in the proof of Theorem 2.2.14. 


2.2.3. ThelInverse Function Theorem 


Corollary 2.2.15 is analogous to the corresponding assertion for smooth maps be- 
tween open subsets of Euclidean space. Likewise, the inverse function theorem for 
manifolds is a partial converse of Corollary 2.2.15. 


Theorem 2.2.17 (Inverse Function Theorem) Assume that M CR‘ and N CR° 
are smooth m-manifolds and f : M — N is a smooth map. Let po € M and sup- 
pose that the derivative 


df (po) : TM > Ty(p9)N 
is a vector space isomorphism. Then there exists an M -open neighborhood U C M 


of po such that V:= f(U) CN is an N-open subset of N and the restric- 
tion f|y : U > V is a diffeomorphism. 
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Fig.2.5 The Inverse Function Theorem 


Proof Choose coordinate charts $9 : Up > Uo, defined on an M -open neighbor- 
hood Up C M of po onto an open set U 0 CR”, and Ww: Vo => Vos defined on an 
N-open neighborhood Vo Cc N of f(po) onto an open set Vo C R™. Shrinking Up, 
if necessary, we may assume that f(Uo) C Vo. Then the map 


f= Woo f ogy! : Uy > Vo 


(see Fig. 2.5) is smooth and its derivative d f (xo): :R” — R” is biective at 
Xo := do(Po). Hence the Inverse Function Theorem A. 2.2 2 asserts that there exists 
an open neighborhood | U ag Uo of xo such that V := F (Ga ) is an open subset of Vo 
and the restriction of f to U isa diffeomorphism from U to V. Hence the assertion 
holds with U := ¢5'(U) and V := >! (V). This proves Theorem 2.2.17. Oo 


2.2.4 Regular Values 


Definition 2.2.18 (Regular value) Let Mc R* be a smooth m-manifold, 
let N C R¢ be a smooth n-manifold, and let f : M — N be a smooth map. An 
element g € N is called a regular value of f iff, forevery p €¢ M with f(p) = q, 
the derivative df(p) : T,M — TycpyN is surjective. 


Theorem 2.2.19 (Regular values) Let f : M — N be as in Definition 2.2.18 and 
letq € N bea regular value of f. Then the set 


P:=f'@M=WEeM|fDM=9 


is a smooth submanifold of R* of dimension m —n and, for each point p € P, its 
tangent space at p is given by 


T,P =kerdf(p) = {v €T1,M |df(p)v = O}. 


Proof 1 Fix a point po € P and choose a linear map A : Rk + R”~” such that 
the restriction of A to the (m — n)-dimensional linear subspace 


ker df(po) = {v €T,,M |df(po) = o} CT,)M Cc R* 
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is a vector space isomorphism. Define the map F : M > N x R”™" by 


F(p) := (f(p), Ap) 


for p € M. The derivative of F at po is given by dF(po)v = (df(po)v, Av) for 
v € T,,M and is a vector space isomorphism. Hence, by Theorem 2.2.17 there ex- 
ists an M-open neighborhood U C M of po such that V := F(U) is an open subset 
of N x R’”” in the relative topology and the restriction F|y : U > V is a diffeo- 
morphism. Hence the P-open set U M P is diffeomorphic to the open set 2 := 
{y €R”™"|(g,y) € V} CR” by the diffeomorphism ¢ : UN P > 92, defined 
by $(p) := Ap for p € UN P, whose inverse is the smooth map w : 22 > UN P 
given by w(y) = (F|y)7!(q, y) for y € &. This shows that P is a smooth (m —n)- 
manifold in RX. 

Now let p € P and v € T,,P. Then there exists a smooth curve y : R — P such 
that y(0) = p and (0) = v. Since f(y(t)) = q for all t, we have 


d 
df(p)v = on _fY®) =0 


f 
and so v €ker df(p). Hence T, P Cker df(p) and equality holds because both 7, P 


and ker df(p) are (m — n)-dimensional linear subspaces of R*. This proves Theo- 
rem 2.2.19. O 


Proof 2 Here is another proof of Theorem 2.2.19 in local coordinates. Fix a 
point po € P and choose a coordinate chart ¢9 : Up > ¢o(Uo) C R” on an M- 
open neighborhood Up C M of po. Likewise, choose a coordinate chart Wo : Vo > 
Wo(Vo) C R” on an N-open neighborhood Vo C N of qg. Shrinking Up, if necessary, 
we may assume that f(Uo) C Vo. Then the point co := Wo(q) is a regular value of 
the map 


fo:= Woo f 06) | : bo(Uo) > R". 


Namely, if x € fo(Uo) satisfies fo(x) = co, then p := ¢)'(x) € UN P, so the 
maps d¢p'(x):R” > T,M, df(p):T,M >T,N, and dwo(q):T,N > R" 
are all surjective, hence so is their composition, and by the chain rule this compo- 
sition is the derivative dfo(x) : R’” — R”. With this understood, it follows from 
Theorem 2.1.10 that the set 


fy ' (co) = {x € Go(Uo) | f(b ' (x) = a} = G0(Uo N P) 


is a manifold of of dimension m — n contained in the open set ¢9(Up) C R”. Us- 
ing Definition 2.1.3 and shrinking Up further, if necessary, we may assume that the 
set d9(Uy M P) is diffeomorphic to an open subset of R’’~”. Composing this dif- 
feomorphism with ¢o we find that Up M P is diffeomorphic to the same open subset 
of R”~". Since the set Uy C M is M-open, there exists an open set U C R* such 
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that UM M = Up, hence UM P = U)N P, and so Up N P is a P-open neigh- 
borhood of po. Thus we have proved that every element po € P has a P-open 
neigborhood that is diffeomorphic to an open subset of R”~”. Thus P C R* is a 
manifold of dimension m — n (Definition 2.1.3). The proof that the tangent spaces 
of P are given by 7,P = kerdf(p) remains unchanged and this completes the 
second proof of Theorem 2.2.19. Oo 


Definition 2.2.20 Let M CR‘ and N C R* be smooth m-manifolds. A smooth map 
f:M — N iscalledalocal diffeomorphism iff its derivative df(p):T, M — Ty(p)N 
is a vector space isomorphism for every p € M. 


Example 2.2.21 The inclusion of an M-open subset U C M into M and the 
map R > S!:t + e¥ are examples of local diffeomorphisms. 


Exercise 2.2.22 Prove that the image of a local diffeomorphism is an open subset 
of the target manifold. Hint: Use the Inverse Function Theorem. 


In the terminology introduced in Sect. 2.3 and Sect. 2.6.1 below, local diffeo- 
morphisms are both immersions and submersions. In particular, if f : M — N is 
a local diffeomorphism, then every element gq € N is a regular value of f and its 
preimage f—'(q) is a discrete subset of M. 


2.3 Submanifolds and Embeddings 


This section deals with subsets of a manifold M that are themselves manifolds as 
in Definition 2.1.3. Such subsets are called submanifolds of M. 


Definition 2.3.1 (Submanifold) Let M@ C R* be an m-dimensional manifold. A 
subset P C M is called a submanifold of M of dimension n, iff P itself is an 
n-manifold. 


Definition 2.3.2 (Embedding) Let M C R* be an m-dimensional manifold 
and N CR° be an n-dimensional manifold. A smooth map f : N > M is 
called an immersion iff its derivatve df(q): T,N — Tyq)M is injective for 
every q € N. It is called proper iff, for every compact subset K C f(N), the 
preimage f—'(K) = {q € N | f(q) € K} is compact. The map / is called an 
embedding iff it is a proper injective immersion. 


Remark 2.3.3 In our definition of proper maps it is important that the compact 
set K is required to be contained in the image of f. The literature also contains a 
stronger definition of proper which requires that f—!(K) is a compact subset of N 
for every compact subset K C M, whether or not K is contained in the image of 
jf. This holds if and only if the map f is proper in the sense of Definition 2.3.2 and 
has an M -closed image. (Exercise!) 
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Fig. 2.6 A coordinate chart U M o(U) 


adapted to a submanifold CNB o — 


Theorem 2.3.4 (Submanifolds) Let M Cc R* be an m-dimensional manifold and 
N C R° be ann-dimensional manifold. 


Gi) If f : N > M is an embedding, then f(N) is a submanifold of M. 

Gi) If P C M isa submanifold, then the inclusion P — M is an embedding. 

(iii) A subset P C M is a submanifold of dimension n if and only if, for ev- 
ery po € P, there exists a coordinate chart ¢: U — R™” on an M-open 
neighborhood U of po such that 6(U NP) = (VU) A (R” x {0}) (Fig. 2.6). 

(iv) A subset P C M is a submanifold of dimension n if and only if, for ev- 
ery po € P, there exists an M-openneighborhood U C M of po anda smooth 
map g: U — R"™—" such that 0 is a regular value of g and U 0 P = g7'(0). 


The proof is based on the following lemma. 


Lemma 2.3.5 (Embeddings) Let M and N be as in Theorem 2.3.4, let f : NM 
be an embedding, let qo € N, and define 


P:=f(N), — po := f(go) € P. 
Then there exists an M-open neighborhood U C M of po, an N-open neighbor- 


hood V C N of qo, an open neighborhood W C R”™ of the origin, and a diffeo- 
morphism F : V x W > U such that, for allq € V andallz € W, 


F(q,0) = f(q), (2.3.1) 
F(q.z)€P = z=0. (2.3.2) 


Proof Choose any coordinate chart ¢9 : Up > IR” on an M-open neighborhood 
Up C M of po. Then d($o o f)(Go) = 4h0(f(Go)) 2 df (Go) : TaN — R” is in- 
jective. Hence there is a linear map B : R”~" — R” such that the map 

Tg) N x R™™" > R™: (wo) > d(goo f)(qo)w + BE (2.3.3) 
is a vector space isomorphism. Define the set 


2:={(q,.z) EN xR" "| f(g) € Uo, bo(f(q)) + Bz € Go(Uo)}. 


This is an open subset of N x R”~” and we define F : 2 — M by 


F(q,z) := $9 '(Go(f(q)) + Bz). 
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This map is smooth, it satisfies F(q,0) = f(q) for all g € f~!(Up), and the deriva- 
tive dF (qo,0) : Ty,N x R”-" — T,,M is the composition of the map (2.3.3) 
with ddo(po)! : R” — T;,M and so is a vector space isomorphism. Thus 
the Inverse Function Theorem 2.2.17 asserts that there is an N-open neighbor- 
hood Vo C N of go and an open neighborhood Wy) Cc R”™” of the origin such 
that Vo x Wo C &, the set Up := F(Vo x Wo) is M-open, and the restriction of F 
to Vo x Wo is a diffeomorphism onto Up. Thus we have constructed a diffeomor- 
phism F’ : Vo x Wo — Up that satisfies (2.3.1). 

We claim that the restriction of F to the product V x W of sufficiently 
small open neighborhoods V Cc N of qo and W C R”” of the origin also sat- 
isfies (2.3.2). Otherwise, there exist sequences g; € Vo converging to go and 
z; € Wo \ {0} converging to zero such that F(q;,2;) € P. Hence there exists a 
sequence g; € N such that F(q;,z;) = f(q;). This sequence converges to f(qo). 
Since f is proper we may assume, passing to a suitable subsequence if necessary, 
that g; converges to a point gq, € N. Then 


Fo) = Jim f(q;) = Jim F(Gi,21) = (qo). 
Since f is injective, this implies gj = qo. Hence (q},0) € Vo x Wo for i suffi- 


ciently large and F(g;,0) = f(g;) = F(qi,z;). This contradicts the fact that the 
map F' : Vo x Wy > M is injective, and proves Lemma 2.3.5. O 


Proof of Theorem 2.3.4 We prove (i). Let go € N, denote po := f(qo) € P, 
and choose a diffeomorphism F : V x W > U as in Lemma 2.3.5. Then the 
set V C N is diffeomorphic to an open subset of R” (after schrinking V if nec- 
essary), the set UM P is P-open because U C M is M-open, and we have 
UN P = {F(q,0)|q € V} = f(V) by (2.3.1) and (2.3.2). Hence the map 
f:V—-U NO P is a diffeomorphism whose inverse is the composition of the 
smooth maps F-!: UN P > V x W and V x W > V: (q,z)  q. Hence a 
P-open neighborhood of po is diffeomorphic to an open subset of IR”. Since po € P 
was chosen arbitrary, this shows that P is an n-dimensional submanifold of MV. 

We prove (ii). The inclusion 1: P — M is obviously smooth and injective (it 
extends to the identity map on R“). Moreover, T,P C T,M for every p € P and 
the derivative di(p) : T, P — T,M is the obvious inclusion for every p € P. That 
is proper follows immediately from the definition. Hence z is an embedding. 

We prove (iii). If a coordinate chart ¢ as in (iii) exists, then the set UM P is 
P-open and is diffeomorphic to an open subset of IR”. Since the point po € P 
was chosen arbitrary this proves that P is an n-dimensional submanifold of M. 
Conversely, suppose that P is an n-dimensional submanifold of M and let po € P. 
Choose any coordinate chart dp : Up > R” of M defined on an M-open neigh- 
borhood Up C M of po. Then ¢o(Uo NM P) is an n-dimensional submanifold 
of R”. Hence Theorem 2.1.10 asserts that there are open sets V,W C R” with 
Po € V C do(Up) and a diffeomorphism y : V — W such that 


pop EV, WV Ad(UoN P)) = WN (R" x {0}). 
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Fig. 2.7 A proper immersion 


Now define U := $9 '(V) C Up. Then po € U, the chart ¢o restricts to a diffeo- 
morphism from U to V, the composition ¢@ := Wo ¢oly : U > W is a diffeomor- 
phism, and @(U N P) = W(V N do(UoN P)) = WN (R" x {0}). 

We prove (iv). That the condition is sufficient follows directly from Theo- 
rem 2.2.19. To prove that it is necessary, assume that P C M is a submanifold of 
dimension 7, fix an element po € P,, and choose a coordinate chart ¢: U > R” on an 
M-open neighborhood U Cc M of po as in part (iii). Define the map g : U > R™™” 
by g(p) := (¢n41(P),---,@m(P)) for p € U. Then 0 is a regular value of g and 
g!(0) = UN P. This proves Theorem 2.3.4. O 


Exercise 2.3.6 Let M C R* be asmooth m-manifold and @ 4 P C M. 


(i) If P is ann-dimensional submanifold of M, then 0 <n <m. 

(ii) P is a 0-dimensional submanifold of M if and only if P is discrete, i.e. ev- 
ery p € P has an M-open neighborhood U such that UM P = {p}. 

(iii) P is an m-dimensional submanifold of M if and only if P is M-open. 


Example 2.3.7 Let S'C R? = C be the unit circle and consider the map f : S'— R? 
given by f(x, y) := (x, xy). This map is a proper immersion but is not injective 
(the points (0, 1) and (0, —1) have the same image under f). The image f(S!) is a 
figure 8 in R? and is not a submanifold (Fig. 2.7). 

The restriction of f to the submanifold N := S! \ {(0, —1)} is an injective im- 
mersion but it is not proper. It has the same image as before and hence f(1) is not 
a manifold. 


Example 2.3.8 The map f : R — R? given by f(t) := (¢7,t°) is proper and 
injective, but is not an embedding (its derivatuve at tf = 0 is not injective). The 
image of f is the set f(R) = C := {(x, y) € R?|x3 = y*} (see Fig. 2.8) and is 
not a submanifold. (Prove this!) 


Fig. 2.8 A proper injection 
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Example 2.3.9 Define the map f : R > R? by f(t) := (cos(t), sin(t)). This map 
is an immersion, but it is neither injective nor proper. However, its image is the unit 
circle in R? and hence is a submanifold of R?. The map R — R?: t + f(t?) is not 
an immersion and is neither injective nor proper, but its image is still the unit circle. 


2.4 Vector Fields and Flows 


This section introduces vector fields on manifolds (Sect. 2.4.1), explains the flow of 
a vector field and the group of diffeomorphisms (Sect. 2.4.2), and defines the Lie 
bracket of two vector fields (Sect. 2.4.4). 


2.4.1. Vector Fields 


Definition 2.4.1 (Vector field) Let M C R* be asmooth m-manifold. A (smooth) 
vector field on M is a smooth map X : M —> R* such that 


X(p) € T,M 
for every p € M. The set of smooth vector fields on M will be denoted by 


Vect(M) := {xX ‘Mak | X is smooth, X(p) € T,M for all p € M}. 


Exercise 2.4.2 Prove that the set of smooth vector fields on M is a real vector 
space. 


Example 2.4.3 Denote the standard cross product on R? by 


X23 — X32 
XX Y= 1 X3V1 — X13 
X12 — X21 


for x, y € R°. Fix a vector € € S* and define the maps X, ¥ : S* > R? by 


X(p) := & xX p, Y(p) := (& x p) x p. 


These are vector fields with zeros +&. Their integral curves (see Definition 2.4.6 
below) are illustrated in Fig. 2.9. 


Fig.2.9 Two vector fields on 
the 2-sphere | we 
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Fig. 2.10 A hyperbolic fixed 
point 
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Example 2.4.4. Let M := R’. A vector field on M is then any smooth map X : 
IR? — R?. As an example consider the vector field 


X(x,y) := (x,—y). 


This vector field has a single zero at the origin and its integral curves are illustrated 
in Fig. 2.10. 


Example 2.4.5 Every smooth function f : R” — R determines a gradient vector 
field 


X=Vf:=| 0x2 |: R” > R”. 


Definition 2.4.6 (Integral curve) Let M Cc R* be a smooth m-manifold, let X 
be a smooth vector field on M, and let J C R be an open interval. A continuously 
differentiable curve y : J > M is called an integral curve of X iff it satisfies the 
equation y(t) = X(y(t)) for every ¢t € 7. Note that every integral curve of X is 
smooth. 


Theorem 2.4.7. Let M C R* be a smooth m-manifold and X € Vect(M) be a 
smooth vector field. Fix a point po € M. Then the following holds. 


(i) There exists an open interval I C R containing 0 and a smooth curve 
y:I— M satisfying the equation 


y(t) = X(v@)), y(0) = po (2.4.1) 


foreveryt € I. 
Gi) Ify.: 1) > M and yz: In > M are two solutions of (2.4.1) on open intervals 
I, and I containing 0, then y,(t) = yo(t) for every t € I) NI. 
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Fig. 2.11 Vector fields in 


M 
local coordinates U 
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Proof We prove (i). Let ¢9 : Up — R” be a coordinate chart on M, defined 
on an M-open neighborhood Up C M of po. The image of ¢o is an open set 
2 := ¢o(Uo) C R” and we denote the inverse map by Wo := ¢)| : 2 > M (see 
Fig. 2.11). Then, by Theorem 2.2.3, the derivative dyo(x) : R’” —> R* is injective 
and its image is the tangent space Ty,.(x) M for every x € 92. Define f : 2 — R” 
by f(x) := d(x)! X(Wo(x)) for x € 2. This map is smooth and hence, by the 
basic existence and uniqueness theorem for ordinary differential equations in R” 
(see [63]), the equation 


x(t) = f(x), x(0) = Xo := ¢0(Po). (2.4.2) 


has a solution x : J — 92 on some open interval J C R containing 0. Hence the 
function y := Woo x: 1 — Up C M isa smooth solution of (2.4.1). This proves (i). 

The local uniqueness theorem asserts that two solutions y; : J; > M of (2.4.1) 
for i = 1,2 agree on the interval (—s,¢) C 1; N J, for e > O sufficiently small. 
This follows immediately from the standard uniqueness theorem for the solutions 
of (2.4.2) in [63] and the fact that x : J > 2 is a solution of (2.4.2) if and only if 
y := Yoox: I > Up isa solution of (2.4.1). 

To prove (ii) we observe that the set J := J; M Jp is an open interval containing 
zero and hence is connected. Now consider the set 


A:={teT|y(t) = prt}. 


This set is nonempty, because 0 € A. It is closed, relative to J, because the maps y, : 
I — M and y2: I > M are continuous. Namely, if t; € J is a sequence converging to 
t EI, then y;(¢;) = y2(t;) for every i and, taking the limit i > oo, we obtain y; (t) = 
y2(t) and hence t € A. The set A is also open by the local uniqueness theorem. Since 
I is connected it follows that A = J. This proves (ii) and Theorem 2.4.7. Oo 


2.4.2. The Flow of a Vector Field 


Definition 2.4.8 (The flow of a vector field) Let M CR* be a smooth m-manifold 
and X € Vect(M) be a smooth vector field on M. For po € M the maximal exis- 
tence interval of po is the open interval 


I(po) = U}! 


IT C Ris an open interval containing 0 
and there is a solution y : J > M of (2.4.1) 
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By Theorem 2.4.7 equation (2.4.1) has a solution y : (po) > M. The flow of X is 
the map ¢ : D — M defined by 


D := {(t, po) | po € M, t € I(po)} 
and @(t, po) := y(t), where y : [(po) — M is the unique solution of (2.4.1). 


Theorem 2.4.9 Let M C R* be a smooth m-manifold and X € Vect(M) be a 
smooth vector field on M. Let 6: D — M be the flow of X. Then the following 
holds. 


(i) Dis an open subset of R x M. 
(ii) The map ¢: D — M is smooth. 
(iii) Let pp € M and s € I(po). Then 


I((s, po)) = I(po) — 5 (2.4.3) 


and, for every t € R with s +t € I(po), we have 


p(s +t, po) = H(t. P(S, Po)). (2.4.4) 
The proof is based on the following lemma. 


Lemma 2.4.10 Let M, X, D, ¢ be as in Theorem 2.4.9 and let K C M be acom- 
pact set. Then there exists an M-open set U C M andane > 0 such that K C U, 
(—e,€) x U C D, and ¢ is smooth on (—é, €) x U. 


Proof nthe case where M = 2 is an open subset of IR” this is proved in [64, 
Satz 4.1.4 & Satz 4.3.1 & Satz 4.4.1]. Using local coordinates (as in the proof of 
Theorem 2.4.7) we deduce that, for every p € M, there exists an M-open neigh- 
borhood U, C M of p anda constant ¢, > 0 such that (—e,, €,) x U, C D and the 
restriction of ¢ to (—é,, €,) X U, is smooth. Using this observation for every p € K 
(and the axiom of choice) we obtain an M-open cover K C |) U,. Since the 


pEeK “Pp 
set K is compact there exists a finite subcover K C Up, U---U Up, =: U. Now 
define ¢ := min{ép,,...,&p,} to deduce that (—e,e) x U C D and ¢ is smooth 
on (—e, €) x U. This proves Lemma 2.4.10. oO 


Proof of Theorem 2.4.9 We prove (iii). The map y : [(po) — s ~ M defined by 
y(t) := $(s +f, po) is a solution of the initial value problem y(t) = X(y(t)) with 
y(0) = &(s, po). Hence I( po) —s C I(P(s, po)) and equation (2.4.4) holds for every 
t €R with s + ¢ € J(po). In particular, with t = —s we have po = $(—s, #(S, po)). 
Thus we obtain equality in equation (2.4.3) by the same argument with the pair 
(s.po) replaced by (5, $(s, Po). 

We prove (1) and (ii). Let (%, po) € D so that po € M and fo € I(po). Suppose 
to > 0. Then K := {@(t, po) |0 < t < to} is acompact subset of M. (It is the image 
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of the compact interval [0, fo] under the unique solution y : J(po) > M of (2.4.1).) 
Hence, by Lemma 2.4.10, there is an M-open set U C M and an ¢ > 0 such that 


K CU, (-e,e)x U CD, 


and ¢ is smooth on (—e, €) x U. Choose N so large that tg/N < e. Define Up := U 
and, fork = 1,..., N, define the sets Ux C M inductively by 


Oe {p € U | b(to/N, P) e Ux_1}. 


These sets are open in the relative topology of M. 

We prove by induction on k that (—e,kto/N + ©) x Ux C D and ¢ is smooth 
on (—e,kto/N + &) x U;,. For k = 0 this holds by definition of ¢ and U. If k € 
{1,..., N} and the assertion holds for k — 1, then we have 


peu => pe, d(to/N, p) € Ux-1 
= (-6,€) C I(p), (—é,(k — Ito/N + €) C L(b(to/N, p)) 
=> (—e,kto/N +8) C I(p). 


Here the last implication follows from (2.4.3). Moreover, for p € U; and to/N —e < 
t <kto/N + «, we have, by (2.4.4), that 


P(t, p) = o(t — to/N, b(to/N, p)) 


Since @(fo/N, p) € Ux_; for p € U, the right hand side is a smooth map on the 
open set (t9/N — ¢,kto/N + &) x Ux. Since U; C U, ¢ is also a smooth map on 
(—e, €) x U; and hence on (—¢, kto/ N +) x U;. This completes the induction. With 
k = N we have found an open neighborhood of (fo, po) contained in D, namely the 
set (—é, fo + €) x Uy, on which ¢ is smooth. The case fy < 0 is treated similarly. 
This proves (i) and (ii) and Theorem 2.4.9. Oo 


Definition 2.4.11 A vector field X € Vect(M) is called complete iff, for each 
Po € M, there exists an integral curve y : R > M of X with y(0) = po. 


Lemma 2.4.12 Let M C R* be a compact manifold. Then every vector field on M 
is complete. 


Proof Let X © Vect(M). It follows from Lemma 2.4.10 with K = M that there 
exists an € > 0 such that (—e,e) C I(p) for all p € M. By Theorem 2.4.9 this 
implies J(p) = R forall p € M. Hence X is complete. oO 


Let M Cc R* bea smooth manifold and X € Vect(M). Then 


Xiscomplette <=> IJI(p)=RVpeEM <= D=RxM. 
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Assume X is complete, let ¢ : R x M — M be the flow of X, and define the map 
¢':M >M by ¢'(p):= ¢(t, p) fort € R and p € M. Then Theorem 2.4.9 asserts 
that ¢' is smooth for every t € R and that 


gett =¢'o ¢', ¢° = id (2.4.5) 


for all s,t € R. In particular, this implies that 6’ of = ¢' 0 ¢' = id. Hence ¢' 
is bijective and (¢')~' = @~“, so each ¢' is a diffeomorphism. 


Exercise 2.4.13 Let M C R* be a smooth manifold. A vector field X on M 
is said to have compact support iff there exists a compact subset K C M such 
that X(p) = 0 for every p € M \ K. Prove that every vector field with compact 
support is complete. 


We close this subsection with an important observation about incomplete vector 
fields. The lemma asserts that an integral curve on a finite existence interval must 
leave every compact subset of M. 

Lemma 2.4.14 Let M Cc R* be a smooth m-manifold, let X € Vect(M), let 


¢g:D— M be the flow of X, let K C M be a compact set, and let pp € M be an 
element such that 


I(po) 1 [0, 00) = [0,b), 0<b<o. 
Then there exists a number 0 < tx < b such that 
te <t<b => o(t, po) Ee M\ K 


Proof By Lemma 2.4.10 there exists an ¢ > 0 such that (—e, ¢) C I(p) for ev- 
ery p € K. Choose € so small that ¢ < b and define 


tk i= b-—e>0. 
Choose a real number tx < t < b. Then I((t, po)) N [0, co) = [0, b —t) by equa- 


tion (2.4.3) in part (ii) of Theorem 2.4.9. Since 0 < b—t < b—tx = 6, this shows 
that (—e,¢) Z I(P(t, po)) and hence $(t, po) ¢ K. This proves Lemma 2.4.14. O 


The next corollary is an immediate consequence of Lemma 2.4.14. In this for- 
mulation the result will be used in Sect. 4.6 and in Sect. 7.3. 


Corollary 2.4.15 Let MC R* be a smooth m-manifold, let X € Vect(M), and 
let y : (0,7) — M be an integral curve of X. If there exists a compact set K C M 
that contains the image of y, then y extends to an integral curve of X on the inter- 
val (—p, T + p) for some p > 0. 
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Proof Here is another more direct proof that does not rely on Lemma 2.4.10. 
Since K is compact, there exists a constant c > 0 such that |X(p)| <c for 
all p € K. Since y(t) € K for 0 < t < T, this implies 


t 


vy) — y(5)| = fimar < fmlar= fixowplar sets) 


AY 


for 0 < s <¢t < T. Thus the limit pp := lim;\o y(¢) exists in Ré and, since K isa 
closed subset of R«, we have py € K C M. Define yo : [0,T) > M by 


Po; fort = 0, 
tyi= 
volt) an for0 <t <T. 


We prove that yo is differentiable at t = 0 and yo(0) = X(po). To see this, fix a 
constant ¢ > 0. Since the curve [0,7) > R* :t + X(y(t)) is continuous, there 
exists a constant 6 > 0 such that 


0<t<65 = [X(y(t)) — X(po)| < €- 


Hence, for 0 < s < t < 5, we have 


ly(t) — y(s) — @ — 5) X(po)| 


| (GO) =X(p)) dr 


= i; (X(y(r) — X(po)) dr 


IA 


if IX(y(r)) — X(po)| dr 
< (t — se. 


Take the limit s — 0 to obtain 


ro — Po _ (po) = Tim Iv) = vis) = C= S)XPoII — 


> t—s 


for 0 < t <6. Thus yo is differentiable at t = 0 with yo(0) = X(po), as claimed. 
Hence y extends to an integral curve y : (—o,T) > M of X for some p >0 
via Y(t) := (t, po) for —p < t < 0 and Y(t) := y(t) for 0 <t < 7. Here ¢ is 
the flow of X. That y also extends beyond t = T, follows by replacing y(t) 
with y(T —t) and X with —X. This proves Corollary 2.4.15. Oo 
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2.4.3 The Group of Diffeomorphisms 


Let us denote the space of diffeomorphisms of M by 
Diff(M) := {¢: M > M | ¢ is a diffeomorphism}. 


This is a group. The group operation is composition and the neutral element is 
the identity. Now equation (2.4.5) asserts that the flow of a complete vector field 
X € Vect(M) is a group homomorphism 


R > Diff(M) :t ¢’. 


This homomorphism is smooth and is characterized by the equation 
d t t 0 
a? M=XeG), (=P 
for all p € M andt € R. We will often abbreviate this equation in the form 
d t t 0 : 
age =Xo#, ~° = id. (2.4.6) 


Exercise 2.4.16 (Isotopy) Let M C R* be a compact manifold and J C R be an 
open interval containing 0. Let 


IxM>R*:(t, p)b X,(p) 


be a smooth map such that X; € Vect(M) for every ¢. Prove that there is a smooth 
family of diffeomorphisms J x M > M : (ft, p) & ¢,(p) satisfying 


d ' 
ae = X,o¢;, goo = id (2.4.7) 
for every ¢ € J. Such a family of diffeomorphisms 
I > Diff(M): tr ¢, 


is called an isotopy of M. Conversely prove that every smooth isotopy J > 
Diff(M) : t th ¢; is generated (uniquely) by a smooth family of vector fields 
I > Vect(M): th X;. 


2.4.4 The Lie Bracket 
Let M Cc R* and N C R° be smooth m-manifolds and X¥ € Vect(M) be smooth 
vector field on M. If w : N — M is a diffeomorphism, then the pullback of X 


under yy is the vector field on N defined by 


(W*X)(q) = dw(qy 'X(W@) (2.4.8) 
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forg ¢ N.If¢: M — N isa diffeomorphism, then the pushforward of X under 
¢ is the vector field on N defined by 


(¢..X )(q) = db(¢ '(q))X(¢'(@)) (2.4.9) 
forge N. 


Lemma 2.4.17 Let M CR*, N CR‘, and P CR" be smooth m-dimensional sub- 
manifolds, let: M — N andy: N — P be diffeomorphisms, and let X € Vect(M ) 
and Z € Vect(P). Then 


oe X = (6 ')*X (2.4.10) 
and 
(Yo b)aX =WeGiX, (Wool Z =o W*Z. (2.4.11) 
Proof Equation (2.4.10) follows from the fact that 
dg'(q) = dp(-'(q)) 1: TN > Ty1qM 


for all g € N (Corollary 2.2.15) and the equations in (2.4.11) follow directly from 
the chain rule (Theorem 2.2.14). This proves Lemma 2.4.17. Oo 


We think of a vector field on M as a smooth map 
X:M—>R‘* 


that satisfies the condition X(p) € 7,M for every p € M. Ignoring this condition 
temporarily, we can differentiate X as a map from M to R* and its derivative at p 
is then a linear map 


dX(p):T,M > R*. 


In general, this derivative will not take values in the tangent space T,, M. However, 
if we have two vector fields X and Y on M, then the next lemma shows that the 
difference of the derivative of X in the direction Y and of Y in the direction X does 
take values in the tangent spaces of M. 


Lemma 2.4.18 (Lie bracket) Let MC R* be asmooth m-manifold and let X,Y € 
Vect(M) be complete vector fields. Denote by 


R-Diff(M):te¢', R—Diff(M):te Ww 


the flows of X and Y, respectively. Fix a point p € M and define the smooth map 
y:R—-M by 


vit) := og oplod tow (p). (2.4.12) 
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Then y (0) = 0 and 


d Le 
di | ag? 1) = 370 
d| oo. 
Z =| .Y)(p) (2.4.13) 
d t\* 
=5 _() X)(p) 


= dX(p)Y(p) — dY(p)X(p) € 1,M. 


Exercise 2.4.19 Let y : R > R* bea C?-curve and assume (0) = 0. Prove that 
£),_ov(Vi) = lim, so (YO) — yO) = 470. 


Proof of Lemma 2.4.18 Define the map B : R* > M by 
B(s,t) =p ov odp ow (p) 
for s,t € R. Then y(t) = A(t, t) and 


op 


a5 t) = X(p)— dw'(w (py) XW (rp), (2.4.14) 
0 
te 0) = dp’ (h *(p)Y(G * (vp) — YP) (2.4.15) 


for all s,t € R. Hence 


40) = 8B oo 0) = 
V(0) = 5 -(0,0) + = (0,0) = 0. 


This implies the first equality in (2.4.13) by Exercise (2.4.19). To prove the remain- 
ing assertions, note that B(s, 0) = B(0, t) = p, hence the second derivatives 078 /ds7 
and 076 /dt? vanish at s = t = 0, and therefore 


ce mg 
yO)= 2, 0: 0). (2.4.16) 


Combining equations (2.4.15) and (2.4.16) we find 


ap d 
(s,0) = — 
gt de 


0 
WO = = 16° (PY (P)) 
. =0 


Ss 


= |... 
s s=0 
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Likewise, combining equations (2.4.14) and (2.4.16) we find 


af 


1. 
370) 


t 


dt 
d 
dt|, 
d 
dt 


t=0 


0 d 
9B oo, t)=—-— 
10 OS 


FP dy’ (oy (p)) Xv (P)) 
t=0 


dy (w' (p)) XW" (p)) 
=0 

dw'(p) 'X(w'(p)) 
=0 


((w')"X)(p)). 


In both cases the right hand side is the derivative of a smooth curve in the tangent 
space 7, M and so is itself an element of 7, M. Moreover, we have 


SIO = S| dso(y¥O°) 
s=0 
0 AY t —i 
= Os| 9 op og *(p) 
i) AY t =i 
= ala oY oo 
=5| (XO"(p)) — dvr) Xp) 
t=0 
= dX(p)Y(p) — = _o 3s Ra 0 $*(p) 
a t Ss 
= AXP) 5] | Woe) 
a 
= dX(p¥(v)— =] YO") 
s s=0 


= dX(p)Y(p) — dY(p)X(p). 


This proves Lemma 2.4.18. 


E 


Definition 2.4.20 (Lie bracket) Let M Cc R* be a smooth manifold and let 
X,Y € Vect(M) be smooth vector fields on M. The Lie bracket of X and Y is the 
vector field [X, Y] € Vect(M) defined by 


[X, Y](p) := dX(p)¥(p) — dY(p)X(p). 


(2.4.17) 


Warning In the literature on differential geometry the Lie bracket of two vector 
fields is often (but not always) defined with the opposite sign. The rationale behind 
the present choice of the sign will be explained in Sect. 2.5.7. 
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Lemma 2.4.21 Let M c R‘ and N C R® be smooth manifolds, let X,Y, Z be 
smooth vector fields on M, and let 


o¢:N>M 
be a diffeomorphism. Then 
o'[X, Y] = [9° X.¢"Y], (2.4.18) 
[X, Y] + [Y, X] = 0, (2.4.19) 
[X, [Y, Z]] + [Y,[Z, X]] + [Z, LX, Y]] = 0. (2.4.20) 


The last equation is called the Jacobi identity. 
Proof Let R > Diff(M) : tb w’ be the flow of Y. Then the map 
R = Diff(V): tr gloy'od 


is the flow of the vector field ¢*Y on N. Hence, by Lemmas 2.4.17 and 2.4.18, we 
have 


d « 
[p°X, ¢°Y] = ay (pi op'od) *X 
t=0 
d * t\* 
Pale or 
= ¢* [X,Y]. 


This proves (2.4.18). Equation (2.4.19) is obvious. To prove (2.4.20), let o’ be the 
flow of X. Then by (2.4.18) and (2.4.19) and Lemma 2.4.18 we have 


d 
(YZ. X= =] @OW.ZI 
t=0 
d ty\* ty* 
= F|_WooV NZI 


(LY, X], Z] + [¥,[Z, X] 
= [Z,[X, Y]] + [¥, [Z, X]]. 


This proves Lemma 2.4.21. Oo 


Definition 2.4.22 A Lie algebra is a real vector space g equipped with a skew- 
symmetric bilinear map g x g > q: (&, 7) > [E, 7] that satisfies the Jacobi iden- 


tity [§. [n. oN] + [n. (8. €1) + [¢. [§. n]] = 0 for all &,,¢ € g. 
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Example 2.4.23 The Vector fields on a smooth manifold M C R* form a Lie 
algebra with the Lie bracket (2.4.17). The space gl(n, R) = R"*” of real n x n- 
matrices is a Lie algebra with the Lie bracket 


[§,n] <= &n — n&. 


It is also interesting to consider subspaces of g!(n, R) that are invariant under this 
Lie bracket. An example is the space 


o(n) := {€ € gl, R)|€"+ & = 0} 


of skew-symmetric 1 x n-matrices. It is a nontrivial fact that every finite-dimensional 
Lie algebra is isomorphic to a Lie subalgebra of gl(n, IR) for some n. For example, 
the cross product defines a Lie algebra structure on R? and the resulting Lie algebra 
is isomorphic to 0(3). 


Remark 2.4.24 There is a linear map R”*” — Vect(IR”) : § > X¢ which assigns 
to a matrix € € gl(m, R) the linear vector field X~¢ : R” — R’” given by X¢(x):=&x 
for x € R”. This map preserves the Lie bracket, i.e. [X¢, X,] = X{¢,7], and hence is 
a Lie algebra homomorphism. 


To understand the Lie bracket geometrically, consider again the curve 
y(t) =p oy op! ow "(p) 


in Lemma 2.4.18, where ¢’ and w’ are the flows of the vector fields X and Y, 
respectively. Since y(0) = 0, Exercise 2.4.19 asserts that 


1 d 
X.YI(p) = 57O= | boy og Moy (py). (2.4.21) 
t=0 


Geometrically this means that by following first the backward flow of Y for time 
€, then the backward flow of X for time ¢, then the forward flow of Y for time e¢, 
and finally the forward flow of X for time ¢, we will not, in general, get back to the 
original point p where we started but approximately obtain an “error” ¢?[X, Y](p). 
An example of this (which we learned from Donaldson) is the mathematical formu- 
lation of parking a car. 


Example 2.4.25 (Parking a car) The configuration space for driving a car in the 
plane is the manifold M := C x S!, where S! C C denotes the unit circle. Thus a 
point in M isa pair p = (z,A) € C x C with |A| = 1. The point z € C represents 
the position of the car and the unit vector A € S! represents the direction in which 
it is pointing. The /eft turn is represented by a vector field X and the right turn 
by a vector field Y on M. These vector field are given by X(z,A) := (A, iA) and 
Y(z,A) := (A, —id). Their Lie bracket is the vector field [X, Y](z, 4) = (—2iA, 0). 
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This vector field represents a sideways move of the car to the right. And a sideways 
move by 2¢? can be achieved by following a backward right turn for time ¢, then 
a backward left turn for time ¢, then a forward right turn for time ¢, and finally a 
forward left turn for time e. 

This example can be reformulated by identifying C with R? via z = x + iy and 
representing a point in the unit circle by the angle 9 € R/27Z via A = e’®. In this 
formulation the manifold is M = R?* x R/2zZ, a point in M is represented by a 
triple (x, y, 9) € R°, the vector fields X¥ and Y are 


X(x, y, 0) := (cos(), sin(@), 1), Y(x, y, 9) := (cos(@), sin(@), —1), 
and their Lie bracket is [X, Y](x, y, @) = 2(sin(@), —cos(@), 0). 


Lemma 2.4.26 Let X,Y € Vect(M) be complete vector fields on a manifold M 
and ¢',w' € Diff(M) be the flows of X and Y, respectively. Then the Lie bracket 
[X, Y] vanishes if and only if the flows of X and Y commute, i.e. ¢§ ow! = y' o * 
foralls,t ER. 


Proof If the flows of X and Y commute, then the Lie bracket [X, Y] vanishes by 
Lemma 2.4.18. Conversely, suppose that [X, Y] = 0. Then we have 


d d 
qe Pal =e] PDa¥ = [XY] = 0 
s dr |,—0 


for every s € R and hence 
(¢*).¥ = Y. (2.4.22) 


Fix a real number s and define the curve y: R —> M by y(t) := ¢*(wW'(p)) 
fort € R. Then y(0) = ¢*(p) and 


V(t) = db (v' (Pp) YW" (P) = (4). Y(VO) = YO) 


for all t. Here the last equation follows from (2.4.22). Since w’ is the flow of Y we 
obtain y(t) = w'(#*(p)) for all t € R and this proves Lemma 2.4.26. Oo 


Exercise 2.4.27 In the situation of Lemma 2.4.26 prove that {¢' o w'},ep is the 
flow of the vector field X + Y. 


2.5 Lie Groups 


Combining the concept of a group and a manifold, it is interesting to consider 
groups which are also manifolds and have the property that the group operation 
and the inverse define smooth maps. We shall only consider groups of matrices. 
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2.5.1 Definition and Examples 


Definition 2.5.1 (Lie group) A nonempty subset G C R”*” is called a Lie group 
iff it is a submanifold of R”*” and a subgroup of GL(n, R), i.e. 


g heG => gheG 
(where gh denotes the product of the matrices g and /) and 
geG = det(g) 4 Oandg '€G. 


(Since G ¥ G it follows from these conditions that the identity matrix 1 is an element 
of G.) 


Example 2.5.2 The general linear group G = GL(n, R) is an open subset of R”*” 
and hence is a Lie group. By Exercise 2.1.19 the special linear group 


SL(n, R) = {g € GL(v, R) | det(g) = 1} 
is a Lie group and, by Example 2.1.20, the special orthogonal group 
SO(n) := {g € GL(n,R)| g'g = 1, det(g) = 1} 


is a Lie group. In fact every orthogonal matrix has determinant +1 and so SO(7) is 
an open subset of O(7) (in the relative topology). 

In a similar vein the group GL(n, C) := {g € C”*" | det(g) 4 0} of complex 
matrices with nonzero (complex) determinant is an open subset of C””” and hence 
is a Lie group. As in the real case, the subgroups 


SL(a, C) := {g € GL(n, C) | det(g) = tt, 
U(n) := {g € GLa, C)|g*g = 1}, 
SU(n) := {g € GL(n, C) | g*g = 1, det(g) = 1} 


are submanifolds of GL(n, C) and hence are Lie groups. Here g* := @' denotes the 
conjugate transpose of a complex matrix. 


Exercise 2.5.3. Prove that SL(n, C), U(7), and SU(7) are Lie groups. Prove that 
SO(n) is connected and that O(7) has two connected components. 


Exercise 2.5.4 Prove that GL(n, C) can be identified with the group 


G := {® € GL(2n,R) | ®Jy = Jo®}, Jo:= (‘ =) 
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Hint: Use the isomorphism R” x R” > C”: (x, y) B® x +1y. Show that a matrix 
® € R*”*?" commutes with Jo if and only if it has the form 


od es ay X,Y eR". 
¥ ¥ 


What is the relation between the real determinant of ® and the complex determinant 
of X + iY? 


Exercise 2.5.5 Let Jo be as in Exercise 2.5.4 and define 
Sp(2n) := {W € GL(2n,R) | YW Jo = Jo}. 


This is the symplectic linear group. Prove that Sp(2) is a Lie group. Hint: 
See [49, Lemma 1.1.12]. 


Example 2.5.6 (Unit quaternions) The quaternions form a four-dimensional as- 
sociative unital algebra HI, equipped with a basis 1,i,j,k. The elements of H are 
vectors of the form 


X=Xo tix, +jx2+ kx3 X09, X1,X%2,x%3 ER. (2.5.1) 


The product structure is the bilinear map H x H — H: (x, y) & xy, determined 
by the relations 


P=f=kP=-1, ij=ji=k, jk=—-kj=i, ki=—ik =j. 


This product structure is associative but not commutative. The quaternions are 
equipped with an involution H — H : x + x, which assigns to a quaternion x of 
the form (2.5.1) its conjugate x := xo — ix; — jx2 — kx3. This involution satisfies 
the conditions 


xPy=i+5, yayx, ex=[xh “lol —ixilyl 


for x, y € H, where |x| := Ae + x} + x3 +x} denotes the Euclidean norm of 
the quaternion (2.5.1). Thus the unit quaternions form a group 


Sp(1) := {x € H||x| = 1} 


with the inverse map x > x. Note that the group Sp(1) is diffeomorphic to the 3- 
sphere S* C R¢ under the isomorphism H = R*. Warning: The unit quaternions 
(a compact Lie group) are not to be confused with the symplectic linear group in 
Exercise 2.5.5 (a noncompact Lie group) despite the similarity in notation. 
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Let G C GL(v, R) be a Lie group. Then the maps 
GxG—>G:(g,h)b gh, GoG:grhg! 


are smooth (see [64]). Fixing an element 4 € G we find that the derivative of the 
map G > G: gb ghat g € Gis given by the linear map 


TzG > TynG : > Bh. (52) 


Here g and / are both matrices in R"*” and gh denotes the matrix product. In fact, 
ifg e T,G, then, since G is a manifold, there exists a smooth curve y : R > Gwith 
y(0) = g and y(0) = g. Since G is a group we obtain a smooth curve B : R > G 


given by B(t) := y(t)h. It satisfies B(0) = gh and so gh = B(0) € T,pG. 

The linear map (2.5.2) is obviously a vector space isomorphism whose inverse 
is given by right multiplication with h~!. It is sometimes convenient to define the 
map R;, : G— Gby 


Ri(g) = gh 


for g € G (right multiplication by h). This is a diffeomorphism and the linear 
map (2.5.2) is the derivative of Rj, at g, so 


dRp(g)g = eh for g € T,G. 
Similarly, each element g ¢ G determines a diffeomorphism L, : G > G, given by 
Lg(h) := gh 


for h € G (left multiplication by g). Its derivative at h € G is again given by matrix 
multiplication, i.e. the linear map dLg(h) : T,G — TG is given by 


dL,(hyh = gh  forh € 7,G. (2.5.3) 


Since L, is a diffeomorphism its derivative dL ,(h) : T,G — T,nG is again a vector 
space isomorphism for every h € G. 


Exercise 2.5.7 Prove that the map G > G: g+> g7! is a diffeomorphism and that 
its derivative at g € G is the vector space isomorphism 


T,G > T,1Giute> =p ‘up, 


Hint: Use [64] or any textbook on first year analysis. 
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2.5.2 The Lie Algebra of a Lie Group 
Let 

Gc GL(n,R) 


be a Lie group. Its tangent space at the identity matrix 1 € G is called the Lie 
algebra of G and will be denoted by 


g = Lie(G) := 71G. 


This terminology is justified by the fact that q is in fact a Lie algebra, i.e. it is 
invariant under the standard Lie bracket operation 


[§,n] <= &n — & 


on the space R”*” of square matrices (see Lemma 2.5.9 below). The proof requires 
the notion of the exponential matrix. For § € R”*” and t € R we define 


Oo rk gk 
exp(té) := )> ae (2.5.4) 
k=0 
A standard result in first year analysis asserts that this series converges absolutely 
(and uniformly on compact t-intervals), that the map 


R— R"" : t & exp(té) 


is smooth and satisfies the differential equation 


d 
a exp(t&) = & exp(t&) = exp(té)é, (2.5.5) 
and that 


exp((s + 1)§) = exp(s&)exp(té), — exp(0&) = 1 (2.5.6) 


for all s,t € R. This shows that the matrix exp(t&) is invertible for each ¢ and that 
the map R > GL(n, R) : t  exp(t&) is a group homomorphism. 


Exercise 2.5.8 Prove the following analogue of (2.4.12). For €, € g 


d 

a7|_ xPOVt8) exp(o/tn) exp(— V1) exp(—vin) = [§,n]. (2577) 
t=0 

In other words, the infinitesimal Lie group commutator is the matrix commutator. 


(Compare Equations (2.5.7) and (2.4.21).) 
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Lemma 2.5.9 Let GC GL(n, R) be a Lie group and denote by g := Lie(G) its Lie 
algebra. Then the following holds. 


(i) If& € gq, then exp(t&) € Gfor everyt € R. 
(ii) If g € Gandy € q, then gng' € g. 
(iti) If. € g, then [§,n] = En—né € g. 


Proof We prove (i). For every g € G we have a vector space isomorphism g = 
TG > T,G :& + &g as in (2.5.2), Hence each element & € g determines a vector 
field X; € Vect(G), defined by 


X;(g) := Eg € T,G, géeG. (2.5.8) 
By Theorem 2.4.7 there is an integral curve y : (—é, €) > G satisfying 
yVO= XO) =Ev®, yvO)=1. 
By (2.5.5), the curve (—e, ¢) > R"*" : t & exp(té&) satisfies the same initial value 
problem and hence, by uniqueness, we have exp(t&) = y(t) € G forall t € R with 


|t| < e. Now let t € R and choose N € N such that |4| < e. Then exp(&) € G 
and hence it follows from (2.5.6) that 


4 Ae 
exp(t&) = exo( <8) eG. 


This proves (1). 
We prove (ii). Consider the smooth curve y : R > R”*” defined by 


y(t) = gexp(tn)g'. 
By (i) we have y(t) € G for every ¢ € R. Since y(0) = 1 we have 
gng' = (0) €g. 


This proves (ii). 
We prove (iii). Define the smooth map 7 : R > R”*” by 


n(t) := exp(t&)n exp(—té). 


By (i) we have exp(t&) € G and, by (ii), we have n(t) € g for every t € R. Hence 
[E, |] = (0) € g. This proves (iii) and Lemma 2.5.9. Oo 


By Lemma 2.5.9 the curve y : R > G defined by y(t) := exp(t&)g is the integral 
curve of the vector field X¢ in (2.5.8) with initial condition y(0) = g. Thus X¢ is 
complete for every & € g. 
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Lemma 2.5.10 If& € g and y : R > Gis a smooth curve satisfying 
yst+O=yov©, rvO=1 yvO)=é, (2.5.9) 
then y(t) = exp(t&) for every t € R. 


Proof For every t € R we have 


y(t) = : = 
yu) = ds a > As 


s= 


y(s)y@) = yO)y(@) = Ev@). 
=0 


Ss 


Hence y is the integral curve of the vector field X¢ in (2.5.8) with y(0) = 1. This 
implies y(t) = exp(t&) for every ¢ € R, as claimed. Oo 


Example 2.5.11 Since the general linear group GL(n, R) is an open subset of 
R”*" its Lie algebra is the space of all real n x n-matrices 


gl(n, R) := Lie(GL(n, R)) = R"”. 
The Lie algebra of the special linear group is 
sl(n, R) := Lie(SL(n, R)) = {é € gl(n, R) | trace(é) = o} 
(see Exercise 2.2.9) and the Lie algebra of the special orthogonal group is 
s0(n) := Lie(SO()) = {é € gl(n, R) | gles o} = o(n) 
(see Example 2.2.10). 


Exercise 2.5.12 Prove that the Lie algebras of the general linear group over C, 
the special linear group over C, the unitary group, and the special unitary group are 
given by 
gl(n, C) := Lie(GL(n, C)) = C”*”, 
sl(n, C) := Lie(SL(v, C)) = {é Ee gl(n,C) | trace(é) = o}, 
u(n) := Lie(U(n)) = {& € gl(n, R) | &* + € = Of, 
su(n) := Lie(SU(n)) = {& € gl(n,C) | &* + € = 0, trace(E) = 0}. 


These are vector spaces over the reals. Determine their real dimensions. Which of 
these are also complex vector spaces? 


Remark 2.5.13 Let G C GL(n, R) be a subgroup. In Theorem 2.5.27 below it 
is shown that G is a Lie group if and only if it is a closed subset of GL(7, R) in 
the relative topology. This observation can be used in many of the examples and 
exercises of the present section. 
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Exercise 2.5.14 Let V be a finite-dimensional vector space. Prove that the vector 
space g := V x End(V) is a Lie algebra with the Lie bracket 


[(u, A), (v, B)] = (Av — Bu, AB — BA) (2.5.10) 


for u,v € V and A, B € End(V). Find the corresponding Lie group. Find an em- 
bedding of g into End(R x V) as a Lie subalgebra. 


Exercise 2.5.15 Let (V,w) be a 2n-dimensional symplectic vector space, so 
@:VxV—R 1s a nondegenerate skew-symmetric bilinear form. The Heisen- 
berg algebra of (V,@) is the Lie algebra h := V x R with the Lie bracket of two 
elements (v,t), (v’,t') € V x R defined by 


[(v, 0), (v', t!)] = (0, e(v, v’)). (2.5.11) 


Find a corresponding Lie group structure on H= V x R. Embed H as a Lie subgroup 
into GL(n + 2,R) and find a formula for the exponential map. Hint: Take V = 
R” x R” and w((x, y), (x, y’)) = (x, y’) —(y, x’) and define (x, y,t)-(x’, yt) := 
+x yty tte + (x,y). 


2.5.3. Lie Group Homomorphisms 


Let G, H be Lie groups and g, § be Lie algebras. A Lie group homomorphism 
from G to H is a smooth map p : G — H that is a group homomorphism. A Lie 
group isomorphism is a bijective Lie group homomorphism whose inverse is also 
a Lie group homomorphism. A Lie group automorphism is a Lie group isomor- 
phism from a Lie group to itself. A Lie algebra homomorphism from g to § is a 
linear map ® : g — h that preserves the Lie bracket. A Lie algebra isomorphism 
is a bijective Lie algebra homomorphism whose inverse is also a Lie algebra homo- 
morphism. A Lie algebra automorphism is a Lie algebra isomorphism from a Lie 
algebra to itself. 


Lemma 2.5.16 Let G and H be Lie groups and denote their Lie algebras 


by g := Lie(G) and § := Lie(H). Let p: G > H be a Lie group homomorphism 
and denote its derivative at 1 € G by 


p:=dp(1):g > 5. 
Then p is a Lie algebra homomorphism. Moreover, 


p(exp(&)) = exp(p(&)), p(gég”') = p(g)p(E)p(g) | 


forallé € gandall g €G. 
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Proof The proof has three steps. 
Step 1 For all — € g andt € R we have p(exp(té)) = exp(tp(&)). 


Fix an element € € q. Then exp(t&) € G for every t € R by Lemma 2.5.9. Thus we 
can define a curve y : R > H by y(t) := p(exp(té)). Since p is smooth, this is a 
smooth curve in H and, since p is a group homomorphism and the exponential map 
satisfies (2.5.6), our curve y satisfies the conditions 


ystH=yov@Q, yvO=1, yO) =dpWé = p€). 
Hence y(t) = exp(t6(&)) by Lemma 2.5.10. This proves Step 1. 
Step 2 For all g € Gand n € g we have p(gng™') = p(g)p(n)p(g)!. 


Define the smooth curve y : R > G by y(t) := g exp(tn)g~!. It takes values in G 
by Lemma 2.5.9. By Step | we have 


p(y(t)) = p(g)e(exp(tn))p(g)' = p(g) exp(tp(n))p(g) | 


for every t. Since y(0) = 1 and y(0) = gng™! we obtain 
A(gng') = dp(v(0))70) 


ply (t)) 
=0 


dt 


t 


dt|, 
p(g)p(n)p(g) ’. 


p(g) exp(tp(n))p(g) | 
=0 


This proves Step 2. 


Step 3 For all &,n € g we have p([&,n]) = [6(€), OI. 


Define the curve n : R > g by n(t) := exp(t£)n exp(—té) fort € R. It takes values 
in the Lie algebra of G by Lemma 2.5.9 and (0) = [&, n]. Hence 


d 
p(lé. nl) = p(exp(té)n exp(—té)) 
0 


t= 


a 

2 “ p(exp(t£)) (n)o(exp(—18)) 
t\+=0 
d 


= a, |. expo) on) exp(—-t6(&)) 
t=0 


[6(E), o(n)]. 
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Here the first equality follows from the fact that p is linear, the second equality 
follows from Step 2 with g = exp(t&), and the third equality follows from Step 1. 
This proves Step 3 and Lemma 2.5.16. O 


Exercise 2.5.17 A Lie group homomorphism p:G-— H is uniquely deter- 
mined by the Lie algebra homomorphism (6 whenever G is connected. Hint: 
If 01, P2 : G > Hare Lie group homomorphisms such that 6; = (2, prove that the 
set A := {g € G| p1(g) = 2(g)} is both open and closed. 


Exercise 2.5.18 If 6: g — 6 is a bijective Lie algebra homomorphism, then its 
inverse is also a Lie algebra homomorphism. 


Exercise 2.5.19 If 0: G— H is a bijective Lie group homomorphism, then 
p-':H—G is smooth and hence p is a Lie group isomorphism. Hint: Use 
Lemma 2.5.16 to prove that 6 : g — h is injective. If # is not surjective, show that p 
has no regular value in contradiction to Sard’s theorem. 


Example 2.5.20 The complex determinant defines a Lie group homomorphism 
det : U(n) > S!. The associated Lie algebra homomorphism is 


trace = det : u(n) > iR = Lie(S!). 
Example 2.5.21 (Unit quaternions and SU(2)) The Lie group SU(2) is diffeo- 
morphic to the 3-sphere. Every matrix in SU(2) can be written as 


— f Xo ity X2 + 1x3 
=X) + 1x3 Xo — 1x1 


) xetxttxptxf=l. (2.5.12) 


Here the x; are real numbers. They can be interpreted as the coordinates of a 
unit quaternion x = Xp + ix; + jx2 + kx3 € Sp(1) (see Example 2.5.6). The reader 
may verify that the map Sp(1) > SU(2) : x & g in (2.5.12) is a Lie group isomor- 
phism. 


Exercise 2.5.22 (The double cover of SO(3)) Identify the imaginary part of H 
with R? and write a vector € € R* = Im(H) as a purely imaginary quaternion € = 
i€; + j& + k&3. Prove that if € € Im(H) and x € Sp(1), then x&x € Im(H). Define 
the map p : Sp(1) > SO(3) by p(x)é := xEx for x € Sp(1) and € € Im(H). Prove 
that the linear map p(x) : R* > R? is represented by the 3 x 3-matrix 


a + a7 — bs — xe 2(x1X2 — X0X3) 2(x1x3 + X9X2) 
p(x) = 2(x1X2 + X9X3) a + ed _ xe _ 7 2(x2x3 — X0X1) 
2(X1xX3 — XoX2) 2(xox3 +. X0x1) XO +.XZ— XT — XG 


Show that p is a Lie group homomorphism. Find a formula for the map 


p:= dp(1): sp(1) > s0(3) 
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and show that it is a Lie algebra isomorphism. For x,y € Sp(1) prove that 
p(x) = p(y) if and only if y = +x. 


Example 2.5.23 Let g be a finite-dimensional Lie algebra. Then the set 
@ is a bijective linear map and 


PE, n] = [E, On] (2.5.13) 
for all E,n € g 


Aut(g):= {P:qrog 


of Lie algebra automorphisms of g is a Lie group. Its Lie algebra is the space of 
derivations on g denoted by 


6 is a linear map and 
5 [E,n] = [6€,n] + [E, 6] ¢- (2.5.14) 
for all E,n € g 


Der(g) := 46:qg79g 


Now suppose that g = Lie(G) is the Lie algebra of a Lie group G. Then there is a 
map Ad: G > Aut(q) defined by 


Ad(g)n := gng (2.5.15) 


for g € Gand n € g. Part (ii) of Lemma 2.5.9 asserts that Ad(g) maps g to itself for 
every g € G. It follows directly from the definitions that the map Ad(g) : g > g is 
a Lie algebra automorphism for every g € G and that the map Ad: G > Aut(q) is a 
Lie group homomorphism. The associated Lie algebra homomorphism is the linear 
map ad : g — Der(q) defined by 


ad(&)n := [&, 7] (2.5.16) 


for €, € q. To verify the equation ad = Ad we compute 


. d 
Ad()n = — 


d 
q|_ dexpGOn = Z| expen exp(—t8) = [E, nl. 
=0 0 


, dt |\,_ 
Exercise 2.5.24 Let g be any Lie algebra. Define the map ad: g — End(q) 
by (2.5.16) and prove that the endomorphism ad(&) : g > g is a derivation for 


every € € g. Prove that ad : g > Der(q) is a Lie algebra homomorphism. 


Exercise 2.5.25 Let g be any finite-dimensional Lie algebra. Prove that the group 
Aut(q) in (2.5.13) is a Lie subgroup of GL(q) with the Lie algebra Lie(Aut(q)) = 
Der(q). Hint: Show that a linear map 6 : g > g is a derivation if and only if the 
linear map exp(t5) : g — g is a Lie algebra automorphism for every t € R. Use the 
Closed Subgroup Theorem 2.5.27. 
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2.5.4 Closed Subgroups 


This section deals with subgroups of a Lie group G that are also submani- 
folds of G. Such subgroups are called Lie subgroups. We assume throughout 
that G C GL(n, R) is a Lie group with the Lie algebra g := Lie(G) = 71G. 


Definition 2.5.26 (Lie subgroup) A subset H C Gis called a Lie subgroup of G 
iff it is both a subgroup and a smooth submanifold of G. 


A useful general criterion is the Closed Subgroup Theorem which asserts that a 
subgroup H C Gis a Lie subgroup if and only if it is a closed subset of G. This was 
first proved in 1929 by John von Neumann [54] for the special case G = GL(n, R) 
and then in 1930 by Elie Cartan [15] in full generality. 


Theorem 2.5.27 (Closed Subgroup Theorem) Let H be a subgroup of G. Then 
the following are equivalent. 


Gi) His a smooth submanifold (and hence a Lie subgroup) of G. 
(ii) His aclosed subset of G. 


If (i) holds, then the Lie algebra of H is the space 
 := {n € g| exp(tn) € H forall t € R}. (5.17) 


Proof of Theorem 2.5.27 (i) => (ii) and (2.5.17) Assume that H is a Lie subgroup 
of G and let h C g be defined by (2.5.17). We prove that is the Lie algebra of H. 
Assume first that 7 € 6. Then the curve y : R > G defined by y(t) := exp(tn) 
for t € R takes values in H and satisfies y(0) = 1 and y(0) = n, and this im- 
plies 7 € 7,;H = Lie(H). Conversely, if 7 € Lie(H), then Lemma 2.5.9 asserts 
that exp(tn) € H for all ¢ € R and hence 7 € §. This shows that h = Lie(H). 

Next we prove in three steps that H is a closed subset of G. Choose any inner 
product on g, denote by |-| the associated norm, and denote by h+ C g the orthog- 
onal complement of § with respect to this inner product. 


Step 1 There exist open neighborhoods V C H of land W C 6+ of the origin such 
that the map ¢: V x W — G, defined by 


pth, §) = hexp() 


forhéV and & € W, is a diffeomorphism from V x W onto an open neighbor- 
hood U = ¢(V x W) Cc Gof 1. 


The derivative of the map H x + > G: (h,&)  hexp(€) at the point (1,0) is 
bijective. Hence Step 1 follows from the Inverse Function Theorem. 
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Step 2 There exists a 8 > 0 such that, if && € 5+ satisfy |é|,|E| <8 and 
exp(&’) exp(—&) € H, then & = &. 


Let V,W,@ be as in Step 1, choose an open neigborhood V’ C G of 1 such 
that V’ MH = V, and choose a constant 6 > 0 such that the following holds. 


(a) If € € b+ satisfies |E| < 5, then & € W. 
(b) If €,&’ € g satisfy |&|, |&’| < 6, then exp(&’) exp(—&) € V’. 


Let &,& € + such that |&|, |&’| <6 and h := exp(€’) exp(—&) €¢ H. Then we 
have &,&' € W by (a) andh Ee V’'NH=V by (b). Also (A, €) = (1, &’) and 
so & = &’, because ¢ is injective on V x W. This proves Step 2. 


Step 3 Let h; be a sequence in H that converges to an element g € G. Then g € H. 


Let 6: V xW —-U be as in Step | and let 6 > 0 be as in Step 2. Since the 
sequence h-'g converges to 1, there exists an ig € N such that h>!g € U for 
alli > io. Hence, for each i > io, there exists a unique pair (h;, &) € V x W such 
that h>'g = h’ exp(&;). This sequence satisfies lim;_,.. &; = 0. Hence there exists 
an integer i; > ig such that |&;| < 6 for all i > i,. Since 


hh’, exp(&:) = g = hjh’, exp(§}), 


we also have exp(&;) exp(—&;) = (hjhi) hj hi, € H for all i, 7 > i,. By Step 3, 
this implies & = & for alli, 7 > i;. Hence & = limj_,.. &; = 0 for all i > i, and 
so g =h;h’, € H. This proves Step 3. 

By Step 3 the Lie subgroup H is a closed subset of G. Thus we have proved 
that (i) implies (ii) and (2.5.17) in Theorem 2.5.27. Oo 


The proof of the converse implication requires three preparatory lemmas. 


Lemma 2.5.28 Let & € g and let y: RG be a curve that is differentiable 
at t = Oand satisfies y(0) = 1 and y(0) = &. Then 


exp(té) = lim y(t /k)* (2.5.18) 
for everyt ER. 


Proof Fix a nonzero real number ¢ and define & := k(y(t/k) — 1) € R™ 
fork € N. Then 
eS oe) ae 
1 =? lim ————— =1y(0) =t 
po ee Be yO) = 
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and hence 
= &\N k 
exp(t€) = lim{1+—) = lim y(t/k)*. 
k-+o0o0 k k-0o0 
(See [64, Satz 1.5.2].) This proves Lemma 2.5.28. Oo 


Lemma 2.5.29 Let H C G be a closed subgroup. Then the set b in (2.5.17) is a 
Lie subalgebra of g 


Proof Let &,n € 6 and define the curve y : R > H by 


y(t) := exp(t&) exp(tn) 


for t € R. This curve is smooth and satisfies y(0) = 1 and y(0) = € + n. Since H 
is closed, it follows from Lemma 2.5.28 that 


exp(t(& +7)) = lim y(t/k)* € H 


for all t € R and so € + 9 € § by definition. Thus h is a linear subspace of g. 
Now fix an element & € §. If A € H, then 


exp(sh'&h) = h7! exp(sé)h € H 
for all s € R and hence h7'€h € 6 by definition. Take h = exp(tn) with n € § 


to obtain exp(—tn)& exp(tn) € § for all t € R. Differentiating this curve at t = 0 
gives [&, ] € 6 and this proves Lemma 2.5.29. oO 


Lemma 2.5.30 Let H C G be a closed subgroup and let h C gq be the Lie subal- 
gebra in (2.5.17). Let € € g, let (&;);en be a sequence in q, and let (t;)jen be a 
sequence of positive real numbers such that 


exp(fi)€H, §& #0 


for alli € N and 
lim 7 = 0, lim & = 0, lim — = &. 
Then & € §. 


Proof Fix a real number ¢. Then, for each i € N, there exists a unique inte- 
germ; € Z such that m;t; < t < (m; + 1)t;. The sequence m; satisfies 


lim m;t; = ft, lim m;& = lim ie = té& 
i—oo t; 


i—0oo i>oo i 
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and hence 
exp(t&) = lim exp(m;&;) = lim exp(&)"" € H. 


Thus exp(t&) € H for every ¢ € R and soé € h by (2.5.17). This proves Lemma 2.5.30. 
Oo 


Proof of Theorem 2.5.27 (ii) => (i) Choose any inner product on g. Let H C G 
be a closed subgroup of G and define the set § C g by (2.5.17). Then § is a Lie 
subalgebra of g by Lemma 2.5.29. Define 

k := dim(6), £:= dim(q) => k, 


and choose a basis 7,..., 7¢ of g such that the vectors 7)...., 7% form a basis of 
and 7, € + for v > k. Let ho € H and define the map © : R‘ > G by 


O(t!,...,t) := ho exp(t'm +--+ + tng) expt neg ++++ + t/N0). 
Then ©(0) = ho, O(R* x {0) C H, and the derivative d@(0) : R' > ThoG is 
bijective. Hence the inverse function theorem asserts that © restricts to a diffeo- 
morphism from an open neighborhood 2 C R° of the origin to an open neighbor- 
hood U := @(&2) C G of ho that satisfies 

@(0) = fo, @(2 Nn (RF x {0}) CUNH. 
We prove the following. 
Claim There exists an open set 2) C R° such that 


0EMCR, @(29 0 (R* x {0})) = UNH, Up := O(M2). (2.5.19) 


Assume, by contradiction, that such an open set (29 does not exist. Then there exists 
a sequence ft; = a ae t!) € R® such that 


lim t; = 0, t; € 2 \ (R* x {03), O(t;) € H. 
Define 


k as 
h; ‘= ho 00 in| eH, §& := D> t?m, € b+ \ {0}. 
v=1 


v=k+1 


Then h; exp(&;) = O(t;) € H and hence 


lim § = 0, & # 0, exp(&) = h;'@(t;) € H. 
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Passing to a subsequence, if necessary, we may assume that the sequence &;/|&;| 
converges. Denote its limit by & := limj—o0 & /|&;|. Then € € § by Lemma 2.5.30 
and € € h+ by definition. Since |&| = 1, this is a contradiction. This contradiction 
proves the Claim. Thus there does, after all, exist an open set 29 C R° that satis- 
fies (2.5.19), and the map @~! : Up —> Mp is then a coordinate chart on G which 
satisfies @~!(Uy NH) = 29 M (R* x {0}). Hence H is a submanifold of G and this 
proves Theorem 2.5.27. Oo 


Exercise 2.5.31 The subgroup {exp(it) |¢ € Q} C S! is not closed. 


Exercise 2.5.32 Choose a nonzero vector (@),...,@,) € R” such that at least one 
of the ratios @; /@; is irrational. Prove that the subgroup 


ie = f(g gees a DY i | t E R} Cc (sy ~T" 


of the torus is not closed. Similar examples exist in any Lie group that contains a 
torus of dimension at least two. 


Exercise 2.5.33 Let Go and G; be Lie subgroups of GL(”, R) with the Lie alge- 
bras go := Lie(Go) and gq; := Lie(G,). Prove that G := Gp N G, is a Lie subgroup 
of GL(n, R) with the Lie algebra g = qo N qj. 


Exercise 2.5.34 (Center) The center of a group G is the subgroup 
Z(G) := {g € G| gh = hg forall h € G}. (2.5.20) 


Let G C GL(n, R) be a Lie group. Prove that its center Z(G) is a Lie subgroup 
of G. If G is connected, prove that the Lie algebra of the center Z(G) is the center 
of the Lie algebra g = Lie(G), defined by 


Z(g) := {€ € g| [En] = 0 for all n € g}. (2.5.21) 


Hint: If G is connected, prove that an element & € g satisfies [&, 1] = 0 for all n € g 
if and only if exp(t®)h = hexp(té) for allt € R andallh €G. 


Exercise 2.5.35 Let GC GL(n, R) be a compact Lie group with the Lie alge- 
bra g := Lie(G) and let € € q. Prove that the set T; := {exp(té) | te R} is aclosed, 
connected, abelian subgroup of G and deduce that it is a Lie subgroup of G (called 
the torus generated by &). 


Exercise 2.5.36 Let GC GL(n,R) be a Lie group with the Lie algebra g and 
let € : R — g be a smooth function. Prove that the differential equation p(t) = 
E(t)y(t), y(O) = 1, has a unique solution y : R — G. Hint: Prove the existence of 
a solution y : R — GL(u, R) and show that the set {t € R| y(t) € G} is open and 
closed. 
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Remark 2.5.37 (Malcev’s Theorem) Let G C GL(n, R) be a Lie group with the 
Lie algebra g and let h C g be a Lie subalgebra. Then the set 


h: [0, 1] + Gis a smooth path 
H := 4 h(1)| such that h(0) = 1 and (2.5.22) 
h(t)h(t)~! € § for all t € [0, 1] 


is a subgroup of G, called the integral subgroup of 5. A theorem by Anatolij 
Ivanovich Malcev [47] (see also [28, Corollary 13.4.6]) asserts that H is a Lie sub- 


group of G if and only if 7, := {exp(t7) | te R} C Hfor ally € §. 
2.5.5 Lie Groups and Diffeomorphisms 
There is a natural correspondence between Lie groups and Lie algebras on the one 


hand and diffeomorphisms and vector fields on the other hand. We summarize this 
correspondence in the following table. 


Lie groups Diffeomorphisms 
G Cc GL(v, R) Diff(M) 
g = Lie(G) = 7;G Vect(M) = TiaDiff(M) 
exponential map flow of a vector field 
t b> exp(té) th d' = “exp(tx)’ 
adjoint representation pushforward 
Ets gig! Xt o.X 
Lie bracket on g Lie bracket of vector fields 
[En] = &n—né [X,Y] =dx-Y—-—dY-X 


To understand the correspondence between the exponential map and the flow of 
a vector field compare equation (2.4.6) with equation (2.5.5). To understand the 
correspondence between the adjoint representation and pushforward observe that 


d 
b.Y = — 


dt dt 


= 2 d Ee 
gow'og',  gng'=—| gexp(tn)g', 
=0 =0 


t t= 


where w‘ denotes the flow of Y. To understand the correspondence between the Lie 
brackets recall that 


iX.y)= 4 


A exp(t&) exp(—té), 


t=0 


i _d 
i? )«Y, [ee n] — dt 


t= 


where ¢' denotes the flow of X. We emphasize that the analogy between Lie groups 
and Diffeomorphisms only works well when the manifold M is compact so that ev- 
ery vector field on M is complete. The next exercise gives another parallel between 
the Lie bracket on the Lie algebra of a Lie group and the Lie bracket of two vector 
fields. 
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Exercise 2.5.38 Let G C GL(n,R) be a Lie group with Lie algebra g and let 
—,n € g. Define the smooth curve y : R > G by 


y(t) := exp(t§) exp(tn) exp(—t§) exp(—tn). 
Show that (0) = 0 and 5¥(0) = [&, n] (cf. Exercise 2.5.8 and Lemma 2.4.18). 


Exercise 2.5.39 Let G C GL(n,R) be a Lie group with Lie algebra g and let 
—,n € g. Show that [€, 7] = 0 if and only if the exponential maps commute, i.e. 
exp(s&) exp(tn) = exp(tn) exp(s&) = exp(s& + tn) for all s,¢ € R. How can this 
observation be deduced from Lemma 2.4.26? 


Definition 2.5.40 Let M Cc R* be a smooth manifold and let G C GL(n, R) be a 
Lie group. A (smooth) group action of G on M is a smooth map 


Gx M > M <(g, pp) $¢(p) (2.5.23) 

that for each pair g,h € G satisfies the condition 
be 2 bh = hgh, gy = id. (2.5.24) 
If (2.5.23) is a smooth group action, then the infinitesimal action of the Lie alge- 


bra g := Lie(G) on M is the map g — Vect(M) : & +> X; defined by 


d 
X¢(p) := ade 
t 


exp(ye)(P) (2.5.25) 
=0 
for§ € qgand pe M. 


Exercise 2.5.41 Let (2.5.23) be a smooth group action of a Lie group G on a 
manifold M. Prove that 


Xeleg = Oe Xe, Xen) = [Xe, Xn] (2.5.26) 
for all g € Gand all €, € g = Lie(G). 
Exercise 2.5.42 Show that the maps GL(m,R) x R” > R”: (gx) b gx, 


SO(m + 1) x S™ > 8S": (g,x) gx, and Rx S!'>S!:(6,z)b ez are 
smooth group actions. Verify the formulas in (2.5.26) in these examples. 


A smooth group action of a Lie group G on a manifold M can the thought of as 
a “Lie group homomorphism” 


G > Diff(M) : g& dg. (2.5.27) 


While the group Diff(/) is infinite-dimensional, and so cannot cannot be a Lie 
group in the formal sense, it has many properties in common with Lie groups as 
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explained above. For example, one can define what is meant by a smooth path 
in Diff(M) and extend formally the notion of a tangent vector (as the derivative 
of a path through a given element of Diff(/)) to this setting. In particular, the 
tangent space of Diff(/) at the identity can then be identified with the space of 
vector fields T,aDiff(/) = Vect(M), and the infinitesimal action in (2.5.25) is the 
Lie algebra homomorphism associated to the “Lie group homomorphism” (2.5.27). 
In fact, we have chosen the sign in the definition of the Lie bracket of vector fields 
so that the map +> X¢ is a Lie algebra homomorphism and not a Lie algebra 
anti-homomorphism. This will be discussed further in Sect. 2.5.7. 


2.5.6 Smooth Maps and Algebra Homomorphisms 


Let M be a smooth submanifold of R*. Denote by ¥(M) := C™%(M, R) the space 
of smooth real valued functions f : M — R. Then ¥(M) is a commutative unital 
algebra. Each p € M determines a unital algebra homomorphism ¢, : F(M) > R 
defined by e,(f) = f(p) for p< M. 


Theorem 2.5.43 Every unital algebra homomorphism ¢ : ¥(M) — R has the form 
€ = €, for some pe M. 


Proof Assume that ¢ : ¥(M) — R is an algebra homomorphism. 
Claim For all f, g € F(M) we have e(g) = 0 => &(f) € f(g7!(0)). 


Indeed, the function f — e(f)- 1 lies in the kernel of ¢ and so the function A := 
(f —e(f)- 1)? + g? also lies in the kernel of ¢. There must be at least one point 
Dp €M where h(p) = 0 for otherwise 1 = e(A)e(1/h) = 0. For this point p we 
have f(p) = e(f) and g(p) = 0, hence p € g~!(0), and therefore e(f) = f(p) € 
FS (g~!(0)). This proves the claim. 

The theorem asserts that there exists a p € M such that every f © F(M) 
satisfies e(f) = f(p). Assume, by contradiction, that this is false. Then for 
every p € M there exists a function f ¢ ¥(M) such that f(p) 4 e(f). Re- 
place f by f —e(f) to obtain f(p) 4 0 = e(f). Now use the axiom of choice 
to obtain a family of functions f, € F(M), one for every p € M, such that 
So(p) #0 = e(fp) for all p € M. Then the set U,:= f, \(R \ {O}) is an M- 
open neighborhood of p for every p € M. Choose a sequence of compact sets 
K, C M such that K, C inty(K,+1) for all n and M = (, Kn. Then, for each 
n, there is a g, € F(M) (a finite sum of the form )°; ee such that e(g,) = 0 
and g,(q) > 0 for all g € K,. If M is compact, this is already a contradic- 
tion because a positive function cannot belong to the kernel of ¢. Otherwise, 
choose f € F(M) such that f(q) =n for all g ¢ M \ K,, and all n € N. Then 
e(f) € f(g,!(0)) C f(M \ Kn) C [n, 0) by the claim and so e(f) > n for all n. 
This is a contradiction and proves Theorem 2.5.43. Oo 
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Now let N be another smooth submanifold (say of R‘) and let C®(M, N) denote 
the space of smooth maps from M to N. A homomorphism from F(N) to F(M) 
is a (real) linear map ® : F¥(N) — F(M) that satisfies 


O(fg)=PO(f)P(g), PO)=1. 


Let Hom(F(N ), F(M)) denote the space of homomorphisms from F(N) to F¥(M). 
An automorphism of the algebra F(M) is a bijective homomorphism @ : FJ(M) > 
F(M). The automorphisms of F(M) form a group denoted by Aut(F(M)). 


Corollary 2.5.44 The pullback operation 
C™(M, N) > Hom(F(N), F(M)): d & o* 


is bijective. In particular, the map Diff(M) — Aut(F(M)) :¢ & @* is an anti- 
isomorphism of groups. 


Proof This is an exercise with hint. Let ® : F(N) > F(M) be a unital alge- 
bra homomorphism. By Theorem 2.5.43 there exists a map 6: M — N such 
that ¢, 0 ® = &g(p) for all p € M. Prove that f of : M — R is smooth for every 
smooth map f : N — R and deduce that ¢ is smooth. O 


Remark 2.5.45 The pullback operation is functorial, i.e. 
(yoo) =¢g* oy", idy, = idg(u). 


for dé ¢ C™(M, N) and w € C™*(N, P). Here id denotes the identity map of the 
space indicated in the subscript. Hence Corollary 2.5.44 may be summarized by 
saying that the category of smooth manifolds and smooth maps is anti-isomorphic 
to a subcategory of the category of commutative unital algebras and unital algebra 
homomorphisms. 


Exercise 2.5.46 If M is compact, then there is a slightly different way to prove 
Theorem 2.5.43. An ideal in F(M) is a linear subspace J C F(M) satisfying 
the condition fe F(M), ged = > fg ed. A maximal ideal in F(M) is 
an ideal J © F(M) such that every ideal J’ ¢ F(M) containing J is equal to J. 
Prove that, if M is compact and g C F(M) is an ideal with the property that for 
every p € M there is an f €J with f(p) #0, then J = F(M). Deduce that 
each maximal ideal in F(M) has the form J, := {f ¢ F(M) | f(p) = 0} for 
some pe M. 


Exercise 2.5.47 If M is compact, give another proof of Corollary 2.5.44 as 
follows. The set ®~'(J,) is a maximal ideal in F(N) for each p € M. Use 
Exercise 2.5.46 to deduce that there is a unique map ¢: M — WN such that 
®'(J,) = J¢cp) for all p € M. Show that ¢ is smooth and ¢* = @. 
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Exercise 2.5.48 It is a theorem of ring theory that, when J C R is an ideal in a ring 
R, the quotient ring R/TJ is a field if and only if the ideal J is maximal. Show that the 
kernel of the ring homomorphism ¢, : J(M) — R of Theorem 2.5.43 is the ideal J, 
of Exercise 2.5.46. Conclude that M is compact if and only if every maximal ideal J 
in F¥(M) is of the form J = J, for some p € M. Hint: The functions of compact 
support form an ideal. It can be shown that if M is not compact and J is a maximal 
ideal containing all functions of compact support, then the quotient field ¥(M)/J 
is anon-Archimedean ordered field which properly contains R. 


2.5.7 Vector Fields and Derivations 
A derivation of F(M) is a linear map 6 : FJ(M) + F(M) that satisfies 


5(fg) = (fg + f4(g). 


and the derivations form a Lie algebra denoted by Der(F(/)). We may think of 
Der(F(M)) as the Lie algebra of Aut(F(M)) with the Lie bracket given by the 
commutator. By Theorem 2.5.43 the pullback operation 


Diff(M) > Aut(F(M)): 6 ¢* (25.28) 


can be thought of as a Lie group anti-isomorphism. Differentiating it at the identity 
¢ = id gives a linear map 


Vect(M) > Der(F(M)): X b Ly. (2.5.29) 


Here the operator Ly : F¥(M) > F(M) is given by the derivative of a function f 
in the direction of the vector field _X, i.e. 


d 
Lyf :=df-X = — fo¢', 
dt |,<0 
where ¢’ denotes the flow of X. Since the map (2.5.29) is the derivative of the 
“Lie group” anti-homomorphism (2.5.28) we expect it to be a Lie algebra anti- 
homomorphism. Indeed, one can show that 


Lixy) = Ly Ly — Ly Ly = —[Ly, Ly] (2.5.30) 


for X, Y € Vect(M). This confirms that our sign in the definition of the Lie bracket 
in Sect. 2.4.4 is consistent with the standard conventions in the theory of Lie groups. 
In the literature the difference between a vector field and the associated derivation 
L£y is sometimes neglected in the notation and many authors write Xf := df -X = 
Ly f, thus thinking of a vector field on a manifold M as an operator on the space of 
functions. With this notation one obtains the equation [X, Y] f = Y(Xf) — X(Vf) 
and here lies the origin for the use of the opposite sign for the Lie bracket in many 
books on differential geometry. 
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Exercise 2.5.49 Prove that the map (2.5.29) is bijective. Hint: Fix a derivation 
6 € Der(F(M)) and prove the following. Fact 1: If U C M is an open set and 
ff € F(M) vanishes on U, then 6(f) vanishes on U. Fact 2: If p € M and the 
derivative df(p) : T, M — R is zero, then (5(f))(p) = 0. (By Fact 1, the proof of 
Fact 2 can be reduced to an argument in local coordinates.) 


Exercise 2.5.50 Verify the formula (2.5.30). 


2.6 Vector Bundles and Submersions 


This section characterizes submersions (Sect. 2.6.1) and introduces the concept of 
a smooth vector bundle in the extrinsic setting (Sect. 2.6.2). 


2.6.1 Submersions 


Let M Cc R* be a smooth m-manifold and N C R® be a smooth n-manifold. A 
smooth map f : N —> M is called a submersion iff its derivative 


df(q) : TyN — Tyq)M 


is surjective for every g € N. 


Lemma 2.6.1 Let M Cc R* be a smooth m-manifold, N C R¢ be a smooth n- 
manifold, and f : N — M beasmooth map. The following are equivalent. 


(i) (ff is a submersion. 

(ii) For every qo € N there is an M-open neighborhood U of po := f(qo) and a 
smooth map g: U — N such that g(f(qdo)) = qo and fog =id:U > U. 
Thus f has a local right inverse near every point in N (see Fig. 2.12). 


Proof We prove that (i) implies (ii). Since the derivative 
df (Go) : TaN — Ty,M 
is surjective we have n > m and 


dim ker df(qo) =n —m. 


Fig. 2.12 A local right in- N q 
verse of a submersion (" naw [ 
el 


—— 


M U Po 
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Hence there is a linear map A: R‘ + R”~” whose restriction to the kernel of df (qo) 
is bijective. Now define the map y : N > M x R”~" by 


¥ (4) = (F(q), A@ — 40) 

forg € N. Then W(qo) = (Po, 0) and its derivative 
dW (Go) : TaN > TpM x R' 

sends w € T,,N to (df(qo)w, Aw) and is therefore bijective. Hence it follows from 
the inverse function theorem for manifolds (Theorem 2.2.17) that there exists an 
N -open neighborhood V C N of go such that the set 

W:=W(N)CMxR'™™ 
is an open neighborhood of (po, 0) and w|y : V > W is a diffeomorphism. Let 

U :={peEM|(p,0)€ W} 
and define the map g : U > N by 

g(p) := | (p, 0). 
Then po € U, g is smooth and 
(p.0) = W(g(p)) = (F(g(P)), A(8(P) — 40))- 

Hence f(g(p)) = p forall p € U and 

(Po) = W'(p0,0) = go. 


This shows that (i) implies (ii). The converse is an easy consequence of the chain 
rule and is left to the reader. This proves Lemma 2.6.1 O 


Corollary 2.6.2 The image of a submersion f : N — M is open. 


Proof If po = f(go) € f(N), then the neighborhood U C M of po in 
Lemma 2.6.1 (ii) is contained in the image of f. Oo 


Corollary 2.6.3. If N is anonempty compact manifold, M is a connected manifold, 
and f : N > M isa submersion, then f is surjective and M is compact. 


Proof The image f(N) is an open subset of M by Corollary 2.6.2, it is a rela- 
tively closed subset of M because N is compact, and it is nonempty because N is 
nonempty. Since M is connected this implies that f(V.) = M. In particular, M is 
compact. Oo 
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Exercise 2.6.4 Let f : N — M be a smooth map. Prove that the sets 
{q € N | df(q) is injective} and {q € N | df(q) is surjective} are open (in the 
relative topology of NV). 


2.6.2 Vector Bundles 
Let M C R* be an m-dimensional smooth manifold. 


Definition 2.6.5 A (smooth) vector bundle (over M of rank 7) is a smooth 
submanifold E C M x R° of dimension m + n such that, for every point p € M, 
the set 


E, := {v € R‘|(p,v) € E} 


is an n-dimensional linear subspace of R° (called the fiber of E over p). A vector 
bundle E over M is equipped with a smooth map 


zt:E—>M 


defined by z(p,v) := p. This map is called the canonical projection of E. 
If EC M x R° is a vector bundle, then a (smooth) section of E is a smooth 
map s : M — R° such that s(p) € E, for every p €¢ M. 


A section s : M — R° of a vector bundle E over M determines a smooth map 
o : M — E which sends the point p € M to the pair (p,s(p)) € E. This map 
satisfies 2 o o = id. It is sometimes convenient to abuse notation and eliminate the 
distinction between s and o. Thus we will sometimes use the same letter s for the 
map from M to R° and the map from M to E. 


Definition 2.6.6 Let M Cc R* be a smooth m-manifold. The set 
TM := {(p. v) | p eM,ve T,M \ 
is called the tangent bundle of V/. 
The tangent bundle is a subset of M x R* and, for every p € M, its fiber T,M 
is an m-dimensional linear subspace of R‘ by Theorem 2.2.3. However, it is not 
immediately obvious from the definition that TM is a submanifold of M x R*. 


This will be proved below. The sections of TM are the vector fields on M. 


Exercise 2.6.7 Let f : M — N be asmooth map between manifolds. Prove that 
the tangent map TM > TN : (p,v) & (f(p), df(p)v) is smooth. 
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Exercise 2.6.8 Let M Cc R* be a smooth m-manifold and let 6: U > Q 
be a smooth coordinate chart on an M-open set U C M with values in an 
open set 22 = @(U) C R”. Prove that the map @: TU > 2 x R” defined 
by (p.v) := (¢(p),do(p)) for p € U and v € T,M is a diffeomorphism. It 
is called a standard coordinate chart on 7M. Deduce that TM is a smooth 2m- 
dimensional submanifold of M x R* and hence is a smooth vector bundle over M. 
(See also Corollary 2.6.12 below.) 


Exercise 2.6.9 Let V C R° be ann-dimensional linear subspace. The orthogonal 
projection of R‘ onto V is the matrix JT ¢ R™° that satisfies 


=? =11', im IT = V. (2.6.1) 


Prove that there is a unique matrix J7 ¢ R™ satisfying (2.6.1). Prove that, for every 
symmetric matrix S = S' ¢ R*, the kernel of S is the orthogonal complement 
of the image of S. If D ¢ R™” is any injective matrix whose image is V, prove 
that det(D'D) 4 0 and 


TW = D(D'D)'D". (2.6.2) 


Theorem 2.6.10 (Vector bundles) Let M Cc R* be a smooth m-manifold and 
let EC M x R° be a subset. Assume that, for every p € M, the set 


E, := {v ER‘ |(p,v) € E} (2.6.3) 


is an n-dimensional linear subspace of R°. Let IT: M > R™® be the map that 
assigns to each p € M the orthogonal projection of R‘ onto Ep, i.e. 


II(p) = II(py = 1I(p)', im IT(p) = Ep. (2.6.4) 
Then the following are equivalent. 


(i) E is a vector bundle. 

(ii) For every po © M andevery vo € E,, there isa smoothmap s : M —> R¢ such 
that s(po) = vo and s(p) € E, forall p € M. 

(iii) The map IT : M — R®® is smooth. 

(iv) For every po € M there is an open neighborhood U C M of po and a diffeo- 
morphism 1~'(U) + U x R": (p,v) & ®(p, v) = (p, ®,(v)) such that the 
map ®, : E, — IR” is an isometric isomorphism for all p € U. 

(v) For every po € M there is an open neighborhood U C M of po and a diffeo- 
morphism 1~'(U) + U x R": (p,v) & ®(p, v) = (p, ®,(v)) such that the 
map ®, : E, — R"” is a vector space isomorphism for all p € U. 


Condition (i) implies that the projection m : E — M is a submersion. In (ii) the 
section Ss can be chosen to have compact support, i.e. there exists a compact subset 
K CM such that s(p) = 0 forall p € M \ K. 


Before giving the proof of Theorem 2.6.10 we explain some of its consequences. 
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Definition 2.6.11 The maps @ : x~'!(U) > U x R" in Theorem 2.6.10 are called 
local trivializations of E. They fit into commutative diagrams 


x!(U) UxR". 


Corollary 2.6.12 Let M C R* be a smooth m-manifold. Then TM is a vector 
bundle over M and hence is a smooth 2m-manifold in R* x R*. 


Proof Let ¢: U — & be a coordinate chart on an M-open set U C M with 
values in an open subset 2 C R”. Denote its inverse by y := ¢ | : Q > M. By 
Theorem 2.2.3 the linear map d(x): IR” — R* is injective and its image is Ty()M 
for every x € @. Hence the map D : U > R**” defined by 


D(p) := dW(o(p)) € R™ 


is smooth and, for every p € U, the linear map D(p) : R” — R* is injective and 
its image is T,M. Thus the function 177™ : M > R'* defined by (2.6.4) with 
E, = T,M is given by 
-1 
1™ (p) = D(p)(D(p)'D(p)) Dp)" for p € U. 


Hence J77™ is smooth and so TM is a vector bundle by Theorem 2.6.10. O 


Let M c Ré be an m-manifold, N C R° be an n-manifold, f:N—>M bea 
smooth map, and E C M x R¢ be a vector bundle. The pullback bundle is the 
vector bundle f* E — N defined by 


fe E:={(qv)eNxR@|veE Era} 
and the normal bundle of E is the vector bundle E+ — M defined by 
E+ := {(p,w) € M x R*|(v,w) =0V ve BE}. 
Corollary 2.6.13 The pullback and normal bundles are vector bundles. 


Proof Let IT = I” : M -> R4¢“ be the map defined by (2.6.4). This map is 
smooth by Theorem 2.6.10. Moreover, the corresponding maps for f*E and E+ 
are given by 


We’ = Teo f: NORM, We =1-T72: MSR, 


These maps are smooth and hence it follows from Theorem 2.6.10 that f* E and E+ 
are vector bundles. O 
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Proof of Theorem 2.6.10 We first assume that E is a vector bundle and prove that 
wt: E—M isasubmersion. Leto: M — E denote the zero section given by o(p):= 
(p,0). Then z oo = id and hence it follows from the chain rule that the derivative 
dx(p,0): Tp) E — T,M is surjective. Now it follows from Exercise 2.6.4 that for 
every p € M there is an e > 0 such that the derivative da(p, v) : Tip» E > T,M 
is surjective for every v € E, with |v| < e. Consider the map f, : E — E defined 
by 


Si(p,v) == (p, Av). 
This map is a diffeomorphism for every A > 0. It satisfies 
wT=10 fy 
and hence 
dx(p,v) = da(p,dv) 0 dfi(p. v) : Tip») E > TpM. 
Since df,(p, v) is bijective and dx(p, Av) is surjective for A < e/|v| it follows 
that d(p, v) is surjective for every p € M and every v € E,. Thus the projection 
mt: E — M isasubmersion for every vector bundle E over M. 

We prove that (i) implies (ii). Let pp ¢ M and vo € E,,. We have already proved 
that z is a submersion. Hence it follows from Lemma 2.6.1 that there exists an 
M-open neighborhood U C M of po and a smooth map 

o:U-E 
such that 
009 = id: U—-U, 00(Po) = (Po; Vo): 
Define the map sy : U > R¢ by 
(p.80(P)) = 90(p) for p € U. 
Then so(po) = vo and so(p) € £, for all p € U. Now choose ¢ > 0 such that 
{(p EM ||p— pol <e} CU 


and choose a smooth cutoff function 6 : R« — [0, 1] such that B( po) = 1 and B(p) =0 
for |p — po| = &. Define s : M — R¢ by 


B(p)so(p), if p € U, 
0, 


= if p ¢U. 


This map satisfies the requirements of (ii). 
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We prove that (ii) implies (iii). Thus we assume that F satisfies (11). Choose 
Po€M anda basis v,..., v, of E,,. By (ii) there exists smooth sections $1,..., Sy: 
M > R° of E such that s; (Po) = v; fori = 1,...,n. Now there exists an M-open 
neighborhood U Cc M of po such that the vectors s1(p),..., S,(p) are linearly 
independent, and hence form a basis of E,, for every p € U. Hence, for every p €U, 
we have 


E,=imD(p), — D(p) := [s1(p)--- Sn (p)] E R™. 


By Exercise 2.6.9, this implies [7(p) = D(p)(D(p)'D(p))"!D(p)' for ev- 
ery p € U. Thus every po € M has a neighborhood U such that the restriction 
of IT to U is smooth. This shows that (11) implies (iii). 

We prove that (iii) implies (iv). Fix a point pp € M and choose a basis v1,..., Vy, 
of E,,. For p € M define 


D(p) := [1(p)u1 ++ FT(p)vn] € R™ 
Then D : M -> R™" is asmooth map and D(po) has rank n. Hence the set 
U :={péM|rankD(p) =n} CM 
is an open neighborhood of pp and E, = imD(p) for all p ¢ U. Thus 
x '(U)={(p,v)€ E|pEeuyceE 
is an open set containing 2~!(po). Define the map ® : 1~'(U) > U x R" by 
P(p.v) = (p.Pp(v)), — Pp(v) = (D(P)"D(D))" D(p)"v 


for p € U and v € E,. This map is bijective and its inverse is given by 


&"(p,£) =(p.6/(6)), — @(E) = D(p)(D(p)" Dp) 


for p€U and & ER". Thus @ is a diffeomorphism and |, (v)| = |v| for 
all p € U andall v € E,. This shows that (iii) implies (iv). 

That (iv) implies (v) is obvious. 

We prove that (v) implies (1). Shrinking U if necessary, we may assume that there 
exists a coordinate chart ¢ : U — {2 with values in an open set §2 C R”. Then 
the composition (# x id) o ® : m~'!(U) > 2 x R" is a diffeomorphism. Thus E C 
R* x R¢ is a manifold of dimension m + n and this proves Theorem 2.6.10. Oo 


Exercise 2.6.14 Define the notion of an isomorphism between two vector bundles 
E and F over M. Construct a vector bundle E Cc S! x R? of rank 1 that does not 
admit a global trivialization, i.e. that is not isomorphic to the trivial bundle S! x R. 
Such a vector bundle is called a MO6bius strip. 
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2.6.3 Thelmplicit Function Theorem 


Next we carry over the Implicit Function Theorem in Corollary A.2.6 to smooth 
maps on vector bundles. 


Theorem 2.6.15 (Implicit Function Theorem) Let M Cc R* be a smooth m- 
manifold, let N Cc R* be a smooth n-manifold, let E C M x R° be a smooth vector 
bundle of rank n, let W C E be open, and let f :W — N be a smooth map. 
For p € M define f, : W, > N by 


W, := {v € E,|(p,v) € W}, fp) = fp, »): 


Let po € M such that 0 € W,, and dfp.(0) : Ep, — TaN is bijective, where 
qo := f(po,9) € N. Then there exists a constant ¢ > 0, open neighborhoods 
Uo C M of po and Vo C N of qo, and a smooth map h : Uy x Vo > R¢ such that 
{(p,v) € E| p € Uo, |v| < e} C W and 


h(p.q) € Ep, |h(p,q)| <€ (2.6.5) 
for all (p,q) € Up x Vo and 
Sp(v) = 0 — v =h(p,q) (2.6.6) 
for all (p,q) € Up X Vo, and all v € E, with |v| < «. 
Proof Choose a coordinate chart w:V — R” on an open set V C N con- 
taining qo. Choose an open neighborhood U C M of po such that (p,0) « W 
and f(p,0) € V for all p € U, there is a coordinate chart 6: U > 22 C R”, 
and there is a local trivialization ® : x~'(U) — U x R" as in Theorem 2.6.10 
with |,(v)| = |v| for p ¢ U and v € E). Define B, := {& € R” | |E| < r} and 


choose r > 0 so small that ®-!(U x B,) C W and fo ®!(U x B,) C V. De- 
fine the map F : 2 x R" x B, > R" by 


F(x, y,6) = wo fo®'(p '(x),€)—y 


for (x,y) € 2x R" and — € B,. Let xo := (po) and yo := W(qo). Then we 
have F(xo, yo, 0) = O and the derivative d3 F' (x0, yo,0) : R” > R” of F with re- 
spect to & at (xo, yo, 0) is bijective. Hence Corollary A.2.6 asserts that there exist 
open neighborhoods Up C U of po and Vo C V of go, a constant 0 < ¢ <r,anda 
smooth map g : 6(Up) x W(VYo) > B, such that 


F(x, y,§) =0 = g(x,y) HE 
for all (x, y) € (Uo) x w(Vo) and all € € B,. Thus the map 


h:UyxVo>R‘, — h(p.q) = © '(g(6(p). ¥(Q))). 


satisfies the requirements of Theorem 2.6.15. Oo 


2.7. The Theorem of Frobenius 77 
2.7. The Theorem of Frobenius 


Let M C R* be an m-dimensional manifold and n be a nonnegative integer. A sub- 
bundle of rank n of the tangent bundle TM is a subset E C TM that is itself 
a vector bundle of rank n over M, i.e. it is a submanifold of 7M and the fiber 
E, = {v €T,M |(p,v) € E\ is an n-dimensional linear subspace of T,,M for ev- 
ery p € M. Note that the rank n of a subbundle is necessarily less than or equal to 
m. In the literature a subbundle of the tangent bundle is sometimes called a distribu- 
tion on M. We shall, however, not use this terminology in order to avoid confusion 
with the concept of a distribution in the functional analytic setting. 


Definition 2.7.1. Let M Cc Ré* be an m-dimensional manifold and E C TM bea 
subbundle of rank n. The subbundle F is called involutive if, for any two vector 
fields X,Y € Vect(M), we have 


X(p). Y(p)€ Ey V pe M — [X,Y\(p)€ E, Vp eM. (2.7.1) 


The subundle E is called integrable if, for every po € M, there exists a submani- 
fold N C M such that po € N and 7T,N = E, for every p € N. A foliation box for 
E (see Fig. 2.13) is a coordinate chart ¢ : U — §2 on an M-open subset U C M 
with values in an open set 22 C R” x R”” such that the set 2M (R” x {y}) is 
connected for every y ¢ R”™” and, for every p € U and every v € T,M, we have 


vekE, —> do(p)v € R” x {0}. 
Theorem 2.7.2 (Frobenius) Let Mc R* be anm-dimensional manifold and E Cc 
TM be asubbundle of rank n. Then the following are equivalent. 


(i) E is involutive. 
(ii) E is integrable. 
(iii) For every po € M there exists a foliation box @ : U > Q with po € U. 


It is easy to show that (iii) => (ii) => (i) (see below). The hard part of the 
theorem is to prove that (i) ==> (iii). This requires the following lemma. 


Lemma 2.7.3. Let E C TM be an involutive subbundle and X € Vect(M) be a 
complete vector field such that X(p) € E, for every p € M. Denote by 


R > Diff(M): tr ¢' 
the flow of X. Then, for all t € IR and all p € M, we have 
dg! (p)E, = Egy). (2.7.2) 


(eye 


Fig. 2.13 A foliation box 
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We show first how Theorem 2.7.2 follows from Lemma 2.7.3. 


Lemma 2.7.3 implies Theorem 2.7.2 We prove first that (iii) implies (ii). 
Let po € M, choose a foliation box ¢ : U > 92 for E with po € U, and define 


N := (p €U|¢(p) € R” x {yo$} 


where (x0, Yo) := @(po) € &. Then N satisfies the requirements of (ii). 

We prove that (ii) implies (i). Choose two vector fields X,Y € Vect(M) that 
satisfy X(p), Y(p) € E, for all p € M and fix a point po € M. Then, by (ii), there 
exists a submanifold N C M containing po such that 7,N = E, for every p € N. 
Hence the restrictions X |, and Y|y are vector fields on N and so is the restriction 
of the Lie bracket [X, Y] to N. Thus we have [X, Y](po) € T,,N = Ep, as claimed. 

We prove that (i) implies (iii). Thus we assume that F is an involutive sub- 
bundle of 7M and fix a point pp € M. By Theorem 2.6.10 there exist vector 
fields X,,..., X, € Vect(M) such that X;(p) € E> for all i and p and the vec- 
tors X;(po),..., Xn(po) form a basis of E,,. Using Theorem 2.6.10 again we find 
vector fields Yj,..., Yn—-n € Vect(M) such that the vectors 


X\(po)..--,Xn(Po), Y1(Po),--- + ¥Ym—n (Po) 


form a basis of 7,, M. Using cutoff functions as in the proof of Theorem 2.6.10 we 
may assume without loss of generality that the vector fields X; and Y; have compact 
support and hence are complete (see Exercise 2.4.13). Denote by ¢/,..., 7 the 
flows of the vector fields X;,..., X,, respectively, and by yj,..., Wj,_,, the flows 
of the vector fields Y,,..., Yin_n. Define the map 


w:R"’xR""—M 
by 
W(X, Y) = Pp! o+++ 0 Gj" 0 Wj! o-+- 0 Wn (Po): 
By Lemma 2.7.3, this map satisfies 


oy 
i € Eyx,y) (2.7.3) 


for all x € IR” and y € R™”. Moreover, 


dy 


Ox; 


0 
(0,0) = Xi(po). 50.0) = Y;(po). 
J 


and so the derivative 


dy(0,0):R" x R"" + T,,M 
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is bijective. Hence, by the Inverse Function Theorem 2.2.17, there exists an open 
neighborhood (2 C R” x R”~” of the origin such that the set 

U:=wW(2)CM 


is an M-open neighborhood of po and W|q : 2 — U is a diffeomorphism. Thus the 
vectors 0w/0x;(x, y) are linearly independent for every (x, y) € 92 and, by (2.7.3), 
form a basis of Ey (x,y). Hence 


¢:=(vley':U +2 


is a foliation box and this proves Theorem 2.7.2, assuming Lemma 2.7.3. 


To complete the proof of the Frobenius theorem it remains to prove Lemma 2.7.3. 
This requires the following result. 


Lemma 2.7.4 Let E C TM be an involutive subbundle. If B :R? > M is a smooth 
map such that 


a r) 
s.0) € Epis), aa t) € Egos), (2.7.4) 


for all s,t € R, then 
op 
—(s,t) € Easy), DAD 
foralls,t €R. 
We first show how Lemma 2.7.3 follows from Lemma 2.7.4. 


Lemma 2.7.4 implies Lemma 2.7.3 Let X € Vect(M) be a complete vector field 
satisfying X(p) € E, for every p € M and let ¢' be the flow of X. Choose a 
point po € M and a vector vp € E,,. By Theorem 2.6.10 there is a vector field 
Y € Vect(M) with values in E such that Y(po) = vo. Moreover this vector field may 
be chosen to have compact support and hence it is complete (see Exercise 2.4.13). 
Thus there is a solution y : R — M of the initial value problem 


ys) = Yv(s)), y(0) = po. 
Define 6 : R* > M by 
B(s,t) = $'(y(s)) 
for s,t € R. Then 


0, 
(8.0) = 768) = YS) € Epo, 


0 
(8.1) = X(B.1) € Eps 
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for all s,t € R. Hence it follows from Lemma 2.7.4 that 


0 
46" (pov = 46" (Y(O))V(0) = (0,1) € Ege 


for every ¢ € R. This proves Lemma 2.7.3, assuming Lemma 2.7.4. O 


Proof of Lemma 2.7.4 Given any point po € M we choose a coordinate chart 
o@:U — Q, defined on an M-open set U C M with values in an open set 922 C 
R”" xR”, such that po € U and d$(po) Ep, = R” x {0}. Shrinking U, if necessary, 
we find that for every p € U the linear subspace d(p)E, C R” x R”” is the 
graph of a matrix A € R“’-")*", Thus there exists a smooth map A : Q > R("™~")*" 
such that, for every p € U, 


do(p)Ep =4(E, AX VWEIEER"}, Coy =bM)e2. — (2.7.6) 
For (x, y) € @ define the linear maps 


dA n (m—n)xn dA m—n (m—n)xn 
—(x,y):R"-R ; —(x,y):R —>R 
ax oy 


by 
0A S. OA 0A 0A 
Be I) B= D Fig HY), By HITE Daeg 


for € = (&,...,&,) € R” and 7 = (m,...,%m—n) € R’” ". We prove the following. 


Claim 1 Let (x, y) € 92, &,& € R” and define n,n! € R"” by n := A(x, y)E and 
n! := A(x, y)&’. Then 


0A 0A 0A , OA 

a(x. y)-§ + 3—(x,y)-9 JE = | oy) + 5 (x.y) -f JE. 

Ox oy Ox oy 
The graphs of the matrices A(z) determine a subbundle E C 2x R" with the fibers 

E. = (9) €R" xR" | = A(w, )§} 
for z = (x, y) € 92. This subbundle is the image of the restriction 
Ely := {(p,v)|p €U, v € E,} 

under the diffeomorphism TM|y > Q x R” : (p,v) & (¢(p),d¢(p)v) and 
hence it is involutive. Now fix two elements &,&’ € IR” and define the vector 


fields ¢,¢': 822 > R” by 


G2) :=€,4@§), @:= EAE), ze. 
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Then ¢ and ¢’ are sections of E and their Lie bracket [¢, 6’] is given by 
5, oz) = (0, (dA(z)o'(2))E — AA(Z)E(Z))E’). 


Since E is involutive the Lie bracket [¢, ¢’] must take values in the graph of A. 
Hence the right hand side vanishes and this proves Claim 1. 


Claim 2 Let I, J C R be open intervals and let z = (x, y): I x J > 92 be a smooth 
map. Fix two points Sy € I and ty € J and assume that 


a Cd) 

5 (80s to) = A(x(so, to), y (So, f0)) — (so. to). (2.7.7) 
Ss Os 
oy _ Ox 
57 oD) = A(x(s,t), y(s,t)) 37 oD) (2.7.8) 


foralls € I andt € J. Then 


0 Ox 
2 (59,1) = A(x(s0,1), »(50,t)) = (50,1) (2.7.9) 
os Os 

forallt € J. 


Equation (2.7.9) holds by assumption for ¢ = fo. Moreover, dropping the argument 
Z(so,t) = Zz = (x, y) for notational convenience we obtain 


d (dy, ax Py 8x (dA dx | OA dy \ ax 
dt \ Os ds dsot - ox ot dy dt) ds 


a Ox 7. dA A ax \\ 0x 
ot oy ot ) } Os 
dy f ox 0A ax \\ dx 
dsdt ‘aa ac ds oy ( ds ) } ot 
dy a dx 0A dy\dx 

sot Aa ds oy Os 


dA (dO 
4 (2% a Ox 
dy \ods Os Ot 
_ (9A (day 4 dx \\ Ox 
~ Lay \as as} } ot’ 
Here the second step follows from (2.7.8), the third step follows from Claim 1, and 


the last step follows by differentiating equation (2.7.8) with respect to s. Define the 
curve 7: J > R”™™” by 


ay 
dsot ‘iar 


0 
n(t) = ™(s9,1) — A(e(s0.0), 9460.0) 5 (so, t). 
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By (2.7.7) and what we have just proved, the curve 77 satisfies the linear differential 
equation 


A 0 
nt) = (5 (60.0. 2660.9) : n()) 5 (S00), n(to) = 0. 


Hence y(t) = 0 for all t € J. This proves (2.7.9) and Claim 2. 

Now let 8 : R? — M be a smooth map satisfying (2.7.4) and fix a real num- 
ber so. Consider the set W := {t € R|0;B(so,t) € Episy2)}. By going to local co- 
ordinates, we obtain from Claim 2 that W is open. Moreover, W is obviously closed, 
and W 4 © because 0 € W by (2.7.4). Hence W = R. Since so € R was chosen 
arbitrarily, this proves (2.7.5) and Lemma 2.7.4. Oo 


Any subbundle E Cc TM determines an equivalence relation on M via 


there is a smooth curve y : [0,1] > M@ 


(2.7.10) 
such that y(O) = po, yO) = pi. y(t) © Ey Vt. 


Po~ Pi 


If E is integrable, this equivalence relation is called a foliation and the equivalence 
class of po € M is called the leaf of the foliation through po. The next example 
shows that the leaves do not need to be submanifolds. 
Example 2.7.5 Consider the torus M := S! x S' Cc C? with the tangent bundle 

TM = {(21, 22, 14121, iA2zZ2) €C4| IZ1| = [22] = 1, Ay, Ao € R}. 
Let w,, @2 be real numbers and consider the subbundle 

E:= {(Z1, 22, iw 21, itw2Z2) éc"| lz1| = |r| =1,te€ R}. 

The leaf of this subbundle through z = (z;, 22) € T? is given by 


L= {(e@z1, elt 29) E € RI. 


It is a submanifold if and only if the quotient @; /@» is a rational number (or @2 = 0). 
Otherwise each leaf is a dense subset of T?. 


Exercise 2.7.6 Prove that (2.7.10) defines an equivalence relation for every sub- 
bundle EF CTM. 


Exercise 2.7.7. Each subbundle E Cc TM of rank | is integrable. 
Exercise 2.7.8 Consider the manifold M = R?. Prove that the subbundle E C 


TM =R? xR? with fiber E, = {(€, n, 6) € R? | €—y& =0} over p = (x, y,z) € R? 
is not integrable and that any two points in R? can be joined by a path tangent to E. 
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Exercise 2.7.9 Consider the manifold M = S? Cc R* = C? and define 
B={GHeC’ xe ||2)=1, 0 Le, 3}, 
Thus the fiber 
EC LS’ =e 


is the maximal complex linear subspace of T,S*. Prove that E has real rank 2 and 
is not integrable. 


Exercise 2.7.10 Let E Cc TM bean involutive subbundle of rank n and let L C M 
be a leaf of the foliation determined by FE. A subset V C L is called L-open iff it 
can be written as a union of submanifolds N of M with tangent spaces 7, N = E, 
for p € N. Prove that the L-open sets form a topology on L (called the intrinsic 
topology). Prove that the obvious inclusion ¢ : L — M is continuous with respect 
to the intrinsic topology on L. Prove that the inclusion 1: L + M is proper if 
and only if the intrinsic topology on L agrees with the relative topology inherited 
from M (called the extrinsic topology). 


Remark 2.7.11 It is surprisingly difficult to prove that each closed leaf L of a 
foliation is a submanifold of M.A proof due to David Epstein [19] is sketched 
in Sect. 2.9.4 below. 


2.8 The Intrinsic Definition of a Manifold* 


It is somewhat restrictive to only consider manifolds that are embedded in some 
Euclidean space. Although we shall see that (at least) every compact manifold ad- 
mits an embedding into a Euclidean space, such an embedding is in many cases not 
a natural part of the structure of a manifold. In particular, we encounter manifolds 
that are described as quotient spaces and there are manifolds that are embedded in 
certain infinite-dimensional Hilbert spaces. For this reason it is convenient, at this 
point, to introduce a more general intrinisc definition of a manifold. (See Chap. | 
for an overview.) This requires some background from point set topology that is not 
covered in the first year analysis courses. We shall then see that all the definitions 
and results of this chapter carry over in a natural manner to the intrinsic setting. We 
begin by recalling the intrinsing definition of a smooth manifold in Sect. 1.4. 


2.8.1 Definition and Examples 


Definition 2.8.1 (Smooth m-manifold) Let m € No and M be a set. A chart 
on M is a pair (¢,U) where U C M and ¢ is a bijection from U to an open 
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Fig. 2.14 Coordinate charts 


and transition maps a Di 
M 


set @(U) C R”. Two charts (¢), U;), (¢2, U2) are called compatible iff ¢; (U; NU2) 
and $2(U; M U2) are open and the transition map 


dar = $2096)! (U1 N Ur) > b2(U, N U2) (2.8.1) 


is a diffeomorphism (see Fig. 2.14). A smooth atlas on M is a collection A of 
charts on M any two of which are compatible and such that the sets U, as (¢, U) 
ranges over A, cover M (i.e. for every p € M there exists a chart (¢, U) € A with 
p € U). A maximal smooth atlas is an atlas which contains every chart which is 
compatible with each of its members. A smooth m-manifold is a pair consisting of 
a set M and a maximal atlas A on M. 


In Lemma 1.4.3 it was shown that, if A is an atlas, then so is the collection A 
of all charts compatible with each member of A. Moreover, the atlas A is maximal, 
so every atlas extends uniquely to a maximal atlas. For this reason, a manifold is 
usually specified by giving its underlying set M and some atlas on M. Generally, 
the notation for the atlas is suppressed and the manifold is denoted simply by M. 
The members of the atlas are called coordinate charts or simply charts on M. By 
Lemma 1.3.3 a smooth m-manifold admits a unique topology such that, for each 
chart (¢, U) of the smooth atlas, the set U C M is open and the bijection 


o:U > WU) 


is a homeomorphism onto the open set ¢(U) C R”. This topology is called the 
intrinsic topology of M and is described in the following definition. 


Definition 2.8.2 Let M be a smooth m-manifold. The intrinsic topology on the 
set M is the topology induced by the charts, i.e. a subset 


WcM 


is open in the intrinsic topology iff ¢(U M W) is an open subset of IR” for every 
chart (#, U) on M |! 


' At this point we do not assume that the intrinsic topology on the manifold M is Hausdorff or 
second countable. These hypotheses will be imposed after the end of the present chapter. For 
explanations see the comments at the end of Sect. 2.8.1 and of Sect. 2.9.5. 


2.8 The Intrinsic Definition of a Manifold* 85 


Remark 2.8.3. Let M C R* be smooth m-dimensional submanifold of R* as in 
Definition 2.1.3. Then the set of all diffeomorphisms (¢,U M M) as in Defini- 
tion 2.1.3 form a smooth atlas as in Definition 2.8.1. The intrinsic topology on the 
resulting smooth manifold is the same as the relative topology defined in Sect. 1.3. 


Remark 2.8.4 A topological manifold is a topological space such that each point 
has a neighborhood U homeomorphic to an open subset of R”. Thus a smooth 
manifold (with the intrinsic topology) is a topological manifold and its maximal 
smooth atlas A is a subset of the set Ao of all pairs (6, U) where U C M is an open 
set and ¢ is a homeomorphism from U to an open subset of IR”. One says that the 
maximal smooth atlas A is a smooth structure on the topological manifold M iff 
the topology of M is the intrinsic topology of the smooth structure and every chart 
of the smooth structure is a homeomorphism. As explained in Sect. 1.4 a topological 
manifold can have many distinct smooth structures (see Remark 1.4.6). However, it 
is a deep theorem beyond the scope of this book that there are topological manifolds 
which do not admit any smooth structure. 


Example 2.8.5 The complex projective space CP” is the set 
CPs ite C"*! | Zis a 1-dimensional complex subspace} 
of complex lines in C”*!. It can be identified with the quotient space 
CP = (c+ \ (0))/C* 

of nonzero vectors in C"*! modulo the action of the multiplicative group C* = 
C \ {0} of nonzero complex numbers. The equivalence class of a nonzero vector 
z = (Zo,.--,Zn) € C”*! will be denoted by 

[z] = [Z0: 21 2 -0+ 2 Zn] = {Az|A € C*} 
and the associated line is = Cz. An atlas on CP” is given by the open cover U; := 


{[Zo 2 +++: Zn] |Z; 4 0} fori = 0,1,...,” and the coordinate charts ¢; : U; > C” 
are 


(2.8.2) 


pees 


Z0 Zi-1 Zi+1 Zn 
i Zi ; Zi Zi 


Pj ([Zo 2 +++: Zn]) = ( 


Exercise: Prove that each ¢; is a homeomorphism and the transition maps are 
holomorphic. Prove that the manifold topology is the quotient topology, i.e. if z : 
C”*! \ {0 + CP" denotes the obvious projection, then a subset U C CP" is open 
if and only if z~!(U) is an open subset of C”*! \ {0}. 


Example 2.8.6 The real projective space RP” is the set 


RP” = {¢ C R"t!| £is a 1-dimensional linear subspace} 
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of real lines in R"+!. It can again be identified with the quotient space 
RP” = (R"*! \ {0})/R* 
of nonzero vectors in R"*+! modulo the action of the multiplicative group R* = 
R \ {0} of nonzero real numbers, and the equivalence class of a nonzero vector 
x = (Xo,...,Xn) € R"*! will be denoted by 
[x] = [xo 2 Xp tee Xn] c= {Ax | A © R*}. 
An atlas on RP” is given by the open cover 
U; = {[X0 2 +++: Xn] |x: A 0} 
and the coordinate charts ¢; : U; — IR” are again given by (2.8.2), with z; replaced 
by x;. The arguments in Example 2.8.5 show that these coordinate charts form an 
atlas and the manifold topology is the quotient topology. The transition maps are 
real analytic diffeomorphisms. 
Example 2.8.7 The real 1-torus is the topological space 
T” = R"/Z" 
equipped with the quotient topology. Thus two vectors x, y € R” are equivalent iff 
their difference x — y € Z” is an integer vector and we denote by z : R” > T” the 
obvious projection which assigns to each vector x € R” its equivalence class 


u(x) := [x] :=x+Z". 


Then a set U C T” is open if and only if the set x~'(U) is an open subset of R". 
An atlas on T” is given by the open cover 


Uy := {[x] |x € R”, |x —a@] < 1/2}, 
parametrized by vectors a € R”, and the coordinate charts dy : Uy > R” defined by 
dbu([x]) := x for x € R” with |x — a| < 1/2. Exercise: Show that each transition 
map for this atlas is a translation by an integer vector. 
Example 2.8.8 Consider the complex Grassmannian 
G,(C") := {V C C" | v is a k-dimensional complex linear subspace}. 


This set can again be described as a quotient space G,(C”) = F,(C")/U(k). Here 


Fi (C") = {D € crxk | D*D= 1} 
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denotes the set of unitary k-frames in C” and the group U(k) acts on #,(C”) con- 
travariantly by D +> Dg for g € U(k). The projection 


m: Fe(C") > G(C") 


sends a matrix D € F¥,(C”) to its image V := 2(D):=im D. A subset U C G;(C”) 
is open if and only if z~!(U) is an open subset of ¥,(C”). Given a k-dimensional 
subspace V C C” we can define an open set Uy C G;,(C") as the set of all k- 
dimensional subspaces of C” that can be represented as graphs of linear maps 
from V to V+. This set of graphs can be identified with the complex vector space 
Hom (V, V+) of complex linear maps from V to Vt and hence with C“-)**, 
This leads to an atlas on G,;(C”) with holomorphic transition maps and shows that 
G;(C") is a manifold of complex dimension kn — k?. Exercise: Verify the details 
of this construction. Find explicit formulas for the coordinate charts and their tran- 
sition maps. Carry this over to the real setting. Show that CP” and RP” are special 
cases. 


Example 2.8.9 (The real line with two zeros) A topological space M is called 
Hausdorff iff any two points in M can be separated by disjoint open neighbor- 
hoods. This example shows that a manifold need not be a Hausdorff space. Consider 
the quotient space 


M :=Rx{0,})/= 


where [x, 0] = [x, 1] for x 4 0. An atlas on M consists of two coordinate charts 
goo : Up > Rand ¢; : U; > R where 


Ui:= thx i]|[xeR}, — Gi(lx, iP i= x 


for i = 0,1. Thus M is a 1-manifold. But the topology on M is not Hausdorff, 
because the points [0, 0] and [0, 1] cannot be separated by disjoint open neighbor- 
hoods. 


Example 2.8.10 (A 2-manifold without a countable atlas) Consider the vector 
space X = R x R? with the equivalence relation 


either yj = yo FO, +X = bh + X22 


(41,1, ¥1] = [t2,X2, 2] <= 
or y1 y2 0, ty to, X1 X2-. 


For y 4 0 we have [0, x, y] = [t,x —t/y, y], however, each point (x, 0) on the x- 
axis gets replaced by the uncountable set R x {(x, 0)}. Our manifold is the quotient 
space M := X/ =. This time we do not use the quotient topology but the topology 
induced by our atlas (see Definition 2.8.2). The coordinate charts are parametrized 
by the reals: for t € R the set U; C M and the coordinate chart ¢, : U; > R?’ are 
given by 


U, = {[t,x,y]]x,y ER}, — Oi([t.x, y])) := (x,y). 
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A subset U C M is open, by definition, iff ¢;(U M U;) is an open subset of R? 
for every t € R. With this topology each ¢;, is a homeomorphism from U, onto R? 
and M admits a countable dense subset S := {[0, x, y] |x. y € Q}. However, there 
is no atlas on M consisting of countably many charts. (Each coordinate chart can 
contain at most countably many of the points [t,0,0].) The function f : M—> R 
given by f([t,x, y]) := t + xy is smooth and each point [t, 0, 0] is a critical point 
of f with value t. Thus f has no regular value. Exercise: Show that M is a path- 
connected Hausdorff space. 


In Theorem 2.9.12 we will show that smooth manifolds whose topology is Haus- 
dorff and second countable are precisely those that can be embedded in Euclidean 
space. Most authors tacitly assume that manifolds are Hausdorff and second count- 
able and so will we after the end of the present chapter. However before Sect. 2.9.1 
there is no need to impose these hypotheses. 


2.8.2. Smooth Maps and Diffeomorphisms 


Our next goal is to carry over all the definitions from embedded manifolds in Eu- 
clidean space to the intrinsic setting. 


Definition 2.8.11 (Smooth map) Let 


(M, { (ba, Ua) }aeA)s (N, {(Wa, Ved} pcx) 


be smooth manifolds. A map f : M —> N is called smooth iff it is continuous and 
the map 


fou = Wp f ob; : baa f~'(Vp)) > vp (Vp) (2.8.3) 


is smooth for every a € A and every 6 € B. It is called a diffeomorphism iff it is 
bijective and f and f~! are smooth. The manifolds M and N are called diffeo- 
morphic iff there exists a diffeomorphism f : M > N. 


The reader may check that the notion of a smooth map is independent of the atlas 
used in the definition, that compositions of smooth maps are smooth, and that sums 
and products of smooth maps from M to R are smooth. 

Exercise 2.8.12 Let M be a smooth m-dimensional manifold with an atlas 


A = {(ba, Ua) Sued: 


Consider the quotient space 


M := Ute x boa) /~, 


acA 
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where 


(x ~By) So ') =95'0). 
fora, B € A, x € dy (Uy), and y € @g(Uz). Define an atlas on M by 

Uy:= {[a, x] Ee: € bu(Ua)}, oy ({a. x]) = xX 
Prove that M is a smooth m-manifold and that it is diffeomorphic to M. 


Exercise 2.8.13 Prove that CP! is diffeomorphic to S?. Hint: Stereographic pro- 
jection. 


2.8.3 Tangent Spaces and Derivatives 


In the situation where M is a submanifold of Euclidean space and p € M we have 
defined the tangent space of M at p as the set of all derivatives y(0) of smooth 
curves y : R — M that pass through p = y(0). We cannot do this for manifolds 
in the intrinsic sense, as the derivative of a curve has yet to be defined. In fact, the 
purpose of introducing a tangent space of M is precisely to allow us to define what 
we mean by the derivative of a smooth map. There are two approaches. One is to 
introduce an appropriate equivalence relation on the set of curves through p and the 
other is to use local coordinates. 


Definition 2.8.14 Let M be a manifold with an atlas A = {(¢y, Ug) }ye4 and let 
p € M. Two smooth curves yo, vy) : R > M with yo(0) = y, (0) = p are called 
p-equivalent iff for some (and hence every) a € A with p € Uy we have 


d 


d 
dt 9 OO) _ dt ft). 


We write yo A y, iff yo is p-equivalent to y,; and denote the equivalence class of 
a smooth curve y : R > M with y(0) = p by [y],. Every such equivalence class 
is called a tangent vector of M at p. The tangent space of M at p is the set of 
equivalence classes 


T,M := {[y],|y : R > M is smooth and y(0) = p}. (2.8.4) 


Definition 2.8.15 Let M be a manifold with an atlas A = {(¢y, Ua) }uea and let 
p € M. The A-tangent space of M at p is the quotient space 


TAM := | J{a}x R" /E, (2.8.5) 
peug 
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where the union runs over all aw € A with p € U, and 


(a, &) © (B, n) = d($p 0, ')\(x)E=n, x= da(p). 


Each equivalence class [@, &], is called a tangent vector of M at p. 


In Definition 2.8.14 it is not immediately obvious that the set 7, M in (2.8.4) is 
a vector space. However, the quotient space TAM in (2.8.5) is obviously a vector 
space of dimension m and there is a natural bijection 


d 
T,M > TAM : [y|)p E 5 alr) | ; (2.8.6) 
t=0 Pp 


This bijection induces a vector space structure on the set 7M. In other words, 
the set T,M in (2.8.4) admits a unique vector space structure such that the map 
T,M > TAM in (2.8.6) is a vector space isomorphism. 


Exercise 2.8.16 Verify the phrase “and hence every” in Definition 2.8.14 and de- 
duce that the map 7, M > TAM in (2.8.6) is well defined. Show that it is bijective. 


From now on we will use either Definition 2.8.14 or Definition 2.8.15 or both, 
whichever way is most convenient, and drop the superscript A. 


Definition 2.8.17 (Derivative of a smooth curve) For each smooth curve 
y:R—M with y(O) = p we define the derivative y(0) € T,M as the equiv- 
alence class 


; d 
10) = p= [a 5 


alr) eT,M. 
=0 p 


t 


Definition 2.8.18 (Derivative of a smooth map) Let f : M — N bea smooth 
map between two smooth manifolds (M, {(¢a, Ux)}aea) and (N, {(We. Vp) pez) 
and let p € M. The derivative of f at p is the map 


df(p) : T,M > Ty(p)N 
defined by the formula 


af (P)ly |p = Lf ovr (2.8.7) 


for each smooth curve y : R > M with y(0) = p. Here we use (2.8.4). Under the 
isomorphism (2.8.6) this corresponds to the linear map 


Af (P)a, Elp = [B.dfpalXElyp), X= bap), (2.8.8) 


fora € A with p € U, and B € B with f(p) € Vg, where fg is given by (2.8.3). 
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Remark 2.8.19 Think of N = R” as a manifold with a single coordinate chart, 
namely the identity map wg = id: R” — R". For every g € N = R" the tangent 
space 7, N is then canonically isomorphic to R” via (2.8.5). Thus for every smooth 
map f : M — R" the derivative of f at p € M is alinear map df(p):T,M — R", 
and the formula (2.8.8) reads 


df (py[a, Elp = d(f ogy (XE, X= hal). 


This formula also applies to maps defined on some open subset of M. In particular, 
with f = dy : Uy > R” we have 


dgba(p)la.élp = &. 


Thus the map d¢,(p) : T,M — R” is the canonical vector space isomorphism 
determined by a. 


With these definitions the derivative of f at p is a linear map and we have the 
chain rule for the composition of two smooth maps as in Theorem 2.2.14. 


2.8.4 Submanifolds and Embeddings 


Definition 2.8.20 (Submanifold) Let M be a smooth m-manifold and let n € 
{0,1,...,m}. A subset N C M is called an n-dimensional submanifold of M 
iff, for every p € N, there exists a local coordinate chart 6: U > 92 for M, 
defined on an an open neighborhood U Cc M of p and with values in an open 
set 2 C R" x R”™™”, such that P(U N N) = 22 (R" x {0}). 


By Theorem 2.1.10 an m-manifold M C R* in the sense of Definition 2.1.3 is a 
submanifold of R* in the sense of Definition 2.8.20. By Theorem 2.3.4 the notion 
of a submanifold N C M of a manifold M C R* in Definition 2.3.1 agrees with 
the notion of a submanifold in Definition 2.8.20. 


Exercise 2.8.21 Let N be a submanifold of M@. Show that if M is Hausdorff, so 
is N, andif M is paracompact, so is NV. 


Exercise 2.8.22 Let N be a submanifold of M and let P be a submanifold of NV. 
Prove that P is a submanifold of M. Hint: Use Theorem 2.1.10. 


Exercise 2.8.23 Let N be a submanifold of M. Prove that there exists an open 
set U C M such that N C U and N is closed in the relative topology of U. 


All the theorems we have proved for embedded manifolds and their proofs carry 
over almost word for word to the present setting. For example we have the inverse 
function theorem, the notion of a regular value, the notions of a submersion and of 
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an immersion, the notion of an embedding as a proper injective immersion, and the 
fact from Theorem 2.3.4 that a subset P C M is a submanifold if and only if it is 
the image of an embedding. 


Exercise 2.8.24 (Lines in Euclidean space) The tangent bundle of the 2-sphere 
is the 4-manifold 


TS’ = {@.y) €R’ xR? | jz] = 1, G9) =0} 


(see Example 2.2.6). Define an equivalence relation on TS” by 


def 
(x,y) ~ @',y’) —> xs=4+x, y'=y 


for (x, y), (x’, y’) € TS?. Show that the quotient space TS?/~ can be identified 
with the set L of all lines in R*, by assigning to each pair (x, y) € TS? the line 
Le = {y + tx | ¢ € R} C R?. Show that the space L of lines in R* admits the 
unique structure of a smooth manifold such that the canonical projection TS? > 
L:(x, y)}> x.y is a submersion. Show that the manifold topology on L agrees with 
the quotient topology on 7S?/~. Show that the map L > RP? x R?: Ly y b> ([x], y) 
is an embedding. 


Example 2.8.25 (Veronese embedding) The map 
CP* > CP: [Z0 : 21 3 Za] [z? : ze : zi 1 2Z1Z2 : Z2Z0 : 2021] 


is an embedding. (Exercise: Prove this.) It restricts to an embedding of the real 
projective plane into RP° and also gives rise to embeddings of RP? into R* as well 
as to the Roman surface: an immersion of RP? into R3. (See Example 2.1.17.) There 
are similar embeddings 


cP'scpy-! Nes ( ¥ a) 


for all n and d, defined in terms of monomials of degree d inn + 1 variables. These 
are the Veronese embeddings. 


Example 2.8.26 (Pliicker embedding) The Grassmannian G(IR*) of 2-planes in 
IR* is asmooth 4-manifold and can be expressed as the quotient of the space F2(R*) 
of orthonormal 2-frames in R* by the orthogonal group O(2). (See Example 2.8.8.) 
Write an orthonormal 2-frame in R* as a matrix 


Xo Yo 
pal 7). pp =4, 
X22 
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Then the map f : G)(IR*) > RP”, defined by 

F(([P]) := [Poi : Po2 : Poz : P23? P31: Pir); Dij = Xi Vj — Xj Vi, 
is an embedding and its image is the quadric 


X := f(G(R*) = {p € RP? | poi p23 + Porp31 + Pos P12 = 0}. 


(Exercise: Prove this.) There are analogous embeddings 
n N-1 n 
f : G(R”) > RP*™, nm (7), 


for all k and n, defined in terms of the k x k-minors of the (orthonormal) frames. 
These are the Pliicker embeddings. 


2.8.5 Tangent Bundle and Vector Fields 


Let M be a m-manifold with an atlas A = {(¢y, Uw)},< 4. The tangent bundle of 
M is defined as the disjoint union of the tangent spaces, i.e. 


TM := \_){p} x T,M = {(p,v)|p € M, ve T,M}. 
peM 


Denote by x : TM — M the projection given by z(p, v) := p. Recall the notion 
of a submersion as a smooth map between smooth manifolds, whose derivative is 
surjective at each point. 


Lemma 2.8.27 The tangent bundle of M is a smooth 2m-manifold with coordinate 
charts 


Gy i Ug = 0 '(Uy) > ba(Ua) XR", by (Pp. v) = (bap), dba (pv). 


The projection x : TM — M is a surjective submersion. If M is second countable 
and Hausdorff, so is TM. 


Proof For each pair a, B € A the set 
bu(T a Ug) = baa M Up) x R” 


is open in R” x R” and the transition map 


Ppa = $5 °bg 2 bya Us) > Opa Ug) 
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is given by 
DP palxs8) = (pal), dPpa(x)é) 


for x € dy (Ug N Ug) and — € R” where 
Ppa i= op Py’ 


Thus the transition maps are all diffeomorphisms and so the coordinate charts by 
define an atlas on TM. The topology on TM is determined by this atlas via Defi- 
nition 2.8.2. If M has a countable atlas, so does TM. The remaining assertions are 
easy exercises. Oo 


Definition 2.8.28 Let M be a smooth m-manifold. A (smooth) vector field on 
M is acollection of tangent vectors X(p) € T,M, one for each point p € M, such 
that the map M > TM : p ® (p, X(p)) is smooth. The set of all smooth vector 
fields on M will be denoted by Vect(M). 


Associated to a vector field is a smooth map M — TM whose composition 
with the projection z : TM — M is the identity map on M. Strictly speaking this 
map should be denoted by a symbol other than X, for example by X. However, it is 
convenient at this point, and common practice, to slightly abuse notation and denote 
the map from M to TM also by X. Thus a vector field can be defined as a smooth 
map 


X:M—>TM 
such that 
woX =id:M—-M. 


Such a map is also called a section of the tangent bundle. 
Now suppose A = {(¢y, Ux) }ye4 is an atlas on M and X : M — TM isa vector 
field on M. Then X determines a collection of smooth maps 


Xu: ba(Uy) > R” 
given by 
Xa(x) = dba(p)X(p), p:= ¢, (x), (2.8.9) 


for x € ¢,(U,). We can think of each X,, as a vector field on the open set ¢,(Uy) C 
IR”, representing the vector field X on the coordinate patch U,. These local vector 
fields X, satisfy the condition 


Xp (Ppa(x)) = dbpalx)Xa(x) (2.8.10) 
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for x € ¢y(Uy N Ug). This equation can also be expressed in the form 


XelgaUants) = PpaX bldg Vang): (2.8.11) 


Conversely, any collection of smooth maps Xy : ¢y(Uy) > R” satisfying (2.8.10) 
determines a unique vectorfield X on M via (2.8.9). Thus we can define the Lie 
bracket of two vector fields X, Y € Vect(M) by 


[X, Yu(x) == [Ney Yal(x) = dXu(%x)¥a(x) — dYa(x)Xu(x) (2.8.12 


fora € A and x € ¢,(U,). It follows from equation (2.4.18) in Lemma 2.4.21 that 
the local vector fields 


LX, Ya : Pa(Ua) > R” 
satisfy (2.8.11) and hence determine a unique vector field [X, Y] on M via 


[X, Y](p) := dba(p) [Xa Yal(Ga(p)), pp € Us. (2.8.13) 


Thus the Lie bracket of X and Y is defined on U, as the pullback of the Lie bracket 
of the vector fields X, and Y, under the coordinate chart ¢,. With this understood 
all the results in Sect. 2.4 about vector fields and flows along with their proofs carry 
over word for word to the intrinsic setting whenever M is a Hausdorff space. This 
includes the existence and uniquess result for integral curves in Theorem 2.4.7, 
the concept of the flow of a vector field in Definition 2.4.8 and its properties in 
Theorem 2.4.9, the notion of completeness of a vector field (that the integral curves 
exist for all time), and the various properties of the Lie bracket such as the Jacobi 
identity (2.4.20), the formulas in Lemma 2.4.18, and the fact that the Lie bracket 
of two vector fields vanishes if and only if the corresponding flows commute (see 
Lemma 2.4.26). One can also carry over the notion of a subbundle EF Cc 7M of 
rank 7 to the intrinsic setting by the condition that EF is a smooth submanifold 
of TM and intersects each fiber 7, M in an n-dimensional linear subspace 


E, = {v €T,M |(p,v) € E}. 


Then the characterization of subbundles in Theorem 2.6.10 and the theorem of 
Frobenius 2.7.2 including their proofs also carry over to the intrinsic setting when- 
ever M is a Hausdorff space. 


2.8.6 Coordinate Notation 


Fix a coordinate chart ¢, : Uy > R” on an m-manifold M. The components of dy 
are smooth real valued functions on the open subset U, of M and it is customary to 
denote them by 


xt. x” 2 Uy > R. 
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The derivatives of these functions at p € Uy, are linear functionals 
dx'(p):T,M >R, i=l,...,m. (2.8.14) 
They form a basis of the dual space 
T;M := Hom(7,M,R). 


(A coordinate chart on M can in fact be characterized as an m-tuple of real valued 
functions on an open subset of M whose derivatives are everywhere linearly inde- 
pendent and which, taken together, form an injective map.) The dual basis of 7, M 
will be denoted by 


0 0 
—(p),..., —(p) € T,M. 2.8.15 
5x1 (P) 5m \P) < Tp ( ) 
Thus 

0 1, ifi=j 

dx' (p)—(p) = 5 = 9 ; 

WPF HF8 = 99 aes ae 
fori, j =1,...,m and 0/dx’ is a vector field on the coordinate patch U,,. For each 


p € U, it is the canonical basis of 7, M determined by ¢q. In the notation of (2.8.5) 
and Remark 2.8.19 we have 


0 = 
sai (P) = [a €i|p = dal) 'es, 
x 
where e; = (0,...,0,1,0,...,0) (with 1 in the ith place) denotes the standard 
basis vector of R”. In other words, for all € = (€!,...,€") € R™ and all p € Uy, 
the tangent vector 
v := dby(p) 'E € T,M 
is given by 
m ; a 
v= a, fp = Df (0). (2.8.16) 
i=l 7 
Thus the restriction of a vector field X € Vect(M) to U, has the form 
m : a 
Xluy, = ‘—_, 
lu. ZF ox! 
where £!,...,€” : U, — R are smooth real valued functions. If the map 


Xa: ba(Ua) > R” 
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is defined by (2.8.9), then 


Xn OG, = Eve E™). 


The above notation is motivated by the observation that the derivative of a smooth 
function f : M — R in the direction of a vector field X on a coordinate patch Uy, 
is given by 


m ; of 
£ = er 
xf lus § ee 


Here the term 0f/dx' is understood as first writing f as a function of x!,...,x’", 
then taking the partial derivative, and afterwards expressing this partial derivative 
again as a function of p. Thus 0f/dx' is the shorthand notation for the function 


(s4(f 06;')) 0 ba: Us > R. 


2.9 Consequences of Paracompactness* 


In geometry it is often necessary to turn a construction in local coordinates into a 
global geometric object. A key technical tool for such “local to global” construc- 
tions is an existence theorem for partitions of unity. 


2.9.1 Paracompactness 


The existence of a countable atlas is of fundamental importance for almost ev- 
erything we will prove about manifolds. The next two remarks describe several 
equivalent conditions. 


Remark 2.9.1 Let M be a smooth manifold and denote by 
Usc2” 


the topology induced by the atlas as in Definition 2.8.2. Then the following are 
equivalent. 


(a) M admits a countable atlas. 

(b) M is o-compact, i.e. there is a sequence of compact subsets K; C M such that 
K; C int(K;+,) for every i € N and M = (J, Ki. 

(c) Every open cover of M has a countable subcover. 

(d) M is second countable, i.e. there is a countable collection of open sets B C U 
such that every open set U € U is a union of open sets from the collection B. 
(B is then called a countable base for the topology of M.) 
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That (a) (b) (c) (a) and (a) => (d) follows directly from the definitions. 
The proof that (d) implies (a) requires the construction of a countable refinement 
and the axiom of choice. (A refinement of an open cover {U;};<7 is an open cover 
{Vj$jez such that each set V; is contained in one of the sets Uj.) 


Remark 2.9.2 Let M and U be as in Remark 2.9.1 and suppose in addition that M 
is aconnected Hausdorff space. Then the existence of a countable atlas is also equiv- 
alent to each of the following conditions. 


(e) M is metrizable, i.e. there is a distance function d : M x M — [0,co) such 
that U is the topology induced by d. 

(f) M is paracompact, i.e. every open cover of M has a locally finite refinement. 
(An open cover {V;}j<, is called locally finite iff every p ¢ M has a neighbor- 
hood that intersects only finitely many V;.) 


That (a) implies (e) follows from the Urysohn Metrization Theorem which as- 
serts (in its original form) that every normal second countable topological space is 
metrizable [51, Theorem 34.1]. A topological space M is called normal iff points 
are closed and, for any two disjoint closed sets A, B C M, there are disjoint open 
sets U, V C M such that A C U and B CV. It is called regular iff points are closed 
and, for every closed set A C M and every b € M \ A, there are disjoint open sets 
U,V CM such that ACU andbeV. Itis called locally compact iff, for every open 
set U C M andevery p € U, there is a compact neighborhood of p contained in U. 
It is easy to show that every manifold is locally compact and every locally compact 
Hausdorff space is regular. Tychonoff’s Lemma asserts that a regular topological 
space with a countable base is normal [51, Theorem 32.1]. Hence it follows from 
the Urysohn Metrization Theorem that every Hausdorff manifold with a countable 
base is metrizable. That (e) implies (f) follows from a more general theorem which 
asserts that every metric space is paracompact (see [51, Theorem 41.4] and [62]). 
Conversely, the Smirnov Metrization Theorem asserts that a paracompact Haus- 
dorff space is metrizable if and only if it is locally metrizable, i.e. every point has a 
metrizable neighborhood (see [51, Theorem 42.1]). Since every manifold is locally 
metrizable this shows that (f) implies (e). Thus we have (a) => (e) <=> (f) for 
every Hausdorff manifold. 

The proof that (f) implies (a) does not require the Hausdorff property but we 
do need the assumption that M is connected. (A manifold with uncountably many 
connected components, each of which is paracompact, is itself paracompact but 
does not admit a countable atlas.) Here is a sketch. If M is a paracompact manifold, 
then there is a locally finite open cover {Uw tve4 by coordinate charts. Since each 
set U, has a countable dense subset, the set {a € A| Uy M Ug, 4 O} is at most 
countable for each a € A. Since M is connected we can reach each point from Uy, 
through a finite sequence of sets Uy,,..., Ug, with Ug,_, Ug, # @. This implies 
that the index set A is countable and hence M admits a countable atlas. 
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Remark 2.9.3. A Riemann surface is a 1-dimensional complex manifold (i.e. 
the coordinate charts take values in C and the transition maps are holomorphic) 
with a Hausdorff topology. It is a deep theorem in the theory of Riemann surfaces 
that every connected Riemann surface is necessarily second countable (see [2]). 
Thus pathological examples of the type discussed in Example 2.8.10 cannot be 
constructed with holomorphic transition maps. 


Exercise 2.9.4 Prove that every manifold is locally compact. Find an example of 
a manifold M and a point po € M such that every closed neighborhood of po is 
non-compact. Hint: The example is necessarly non-Hausdorff. 


Exercise 2.9.5 Prove that a manifold M admits a countable atlas if and only if it 
is o-compact if and only if every open cover of M has a countable subcover if and 
only if it is second countable. Hint: The topology of IR” is second countable and 
every open subset of R” is o-compact. 


Exercise 2.9.6 Prove that every submanifold M C R* (Definition 2.1.3) is second 
countable. 


Exercise 2.9.7 Prove that every connected component of a manifold M is an open 
subset of M and is path-connected. 
2.9.2 Partitions of Unity 


Definition 2.9.8 (Partition of unity) Let WM be a smooth manifold. A partition 
of unity on M is a collection of smooth functions 


6, : M — [0,1], aéA, 


such that each point p € M has an open neighborhood V C M on which only 
finitely many @, do not vanish, i.e. 


#{a € A| Oly #0} < 00, (2.9.1) 


and, for every p € M, we have 


Sela) = 1. (2.9.2) 


acd 


If {Uy} ve4 18 an open cover of M, then a partition of unity {O,},.4 (indexed by the 
same set A) is called subordinate to the cover iff each 0, is supported in Ug, i.e. 


supp(6a) := {p € M| 6a(p) # 0} C Uy. 
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Theorem 2.9.9 (Partitions of unity) Let M be a smooth manifold whose topology 
is paracompact and Hausdorff. Then, for every open cover of M, there exists a 
partition of unity subordinate to that cover. 


The proof requires two preparatory lemmas. 


Lemma 2.9.10 Let M be a smooth manifold with a Hausdorff topology. Then, for 
every open set V C M and every compact set K C V, there exists a smooth func- 
tion kK : M — [0,00) with compact support such that k(p) > 0 for every p€ K 
and supp(k) C V. 


Proof Assume first that K = {po} is a single point. Since M is a manifold it is 
locally compact. Hence there is a compact neighborhood C C V of po. Since M 
is Hausdorff C is closed and hence the set U := int(C) is a neighborhood of po 
whose closure U C C is compact and contained in V. Shrinking U, if necessary, we 
may assume that there is a coordinate chart @ : U — §2 with values in some open 
neighborhood 2 C R” of the origin such that (po) = 0. (Here m is the dimen- 
sion of M.) Now choose a smooth function kg : 2 — [0, co) with compact support 
such that Ko(0) > 0. Then the function x : M — (0, 1], defined by k|y := Koo @ 
and «(p) :=0 for p ¢ M\U is supported in V and satisfies k(po) > 0. This 
proves the lemma in the case where K is a point. 

Now let K be any compact subset of V. Then, by the first part of the proof, 
there is a collection of smooth functions k, : M — [0, 00), one for every p € K, 
such that «,(p) > 0 and supp(«,) C V. Since K is compact there are finitely many 
points p;,..., px € K such that the sets {p € M |ky,(p) > o} cover K. Hence 
the function k := )° ; Kp; 18 Supported in V and is everywhere positive on K. This 
proves Lemma 2.9.10. O 


Lemma 2.9.11 Let M be a topological space. If {V;},-, is a locally finite collec- 


tion of open sets in M, then 


ie] 


Uv-ur 


i€lo 1El1o 


for every subset Ig C I. 


Proof The set Uje;, V;_ is obviously contained in the closure of ();. ig Vie 


To prove the converse choose a point po € M\U;. iG V;. Since the collec- 
tion {Vi}ier is locally finite, there exists an open neighborhood U of po such 
that the set J; := {i ¢ 1 | V; NU F G} is finite. Hence the set 


U:=U\ VU Vi 
ielgnh 


is an open neighborhood of po and we have Up N V; = @ for every i € Ip. Hence 
Po € Use Vi- This proves Lemma 2.9.11. Oo 
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Proof of Theorem 2.9.9 Let {Uy} ye4 be an open cover of M. We prove in four 
steps that there is a partition of unity subordinate to this cover. The proofs of steps 
one and two are taken from [51, Lemma 41.6]. 


Step 1 There is a locally finite open cover {Vj}; <1 of M such that, for every i € I, 
the closure V; is compact and contained in one of the sets Uy. 


Denote by V Cc 2™ the set of all open sets V C M such that V is compact and 
V CU, for some a € A. Since M is a locally compact Hausdorff space the collection 
V is an open cover of M. (If p € M, then there is ana € A such that p € Uy; 
since M is locally compact there is a compact neighborhood K C Uy of p; since 
M is Hausdorff K is closed and thus V := int(K) is an open neighborhood of p 
with V C K C U,.) Since M is paracompact the open cover V has a locally finite 
refinement {V;};<,. This cover satisfies the requirements of Step 1. 


Step 2 There is a collection of compact sets K; C V;, one for eachi € I, such that 
M = User Ki. 


Denote by W Cc 2” the set of all open sets W C M such that W C V; for some i. 
Since M is a locally compact Hausdorff space, the collection W is an open cover 
of M. Since M is paracompact W has a locally finite refinement {W;};<7. By the 
axiom of choice there is a map 

Jolijri 
such that 


Wicv, Vied. 


Since the collection {W;}j<, is locally finite, we have 


ij=i ij=i 
by Lemma 2.9.11. Since V; is compact so is Kj. 

Step 3 There is a partition of unity subordinate to the cover {V; hic. 

Choose a collection of compact sets K; C V; for i € I as in Step 2. Then, by 
Lemma 2.9.10 and the axiom of choice, there is a collection of smooth functions 


k; : M — [0,00) with compact support such that 


supp(k;) C Vj, Kilk;, > 90 Viel. 
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Since the cover {V;};<, is locally finite the sum 
i= YK: :M—->R 
ie] 


is locally finite (i.e. each point in M has a neighborhood in which only finitely 
many terms do not vanish) and thus defines a smooth function on M. This function 
is everywhere positive, because each summand is nonnegative and, for each p € M, 
there is ani € J with p € K; so that «;(p) > 0. Thus the funtions y; := «;/k define 
a partition of unity satisfying supp(y;) C V; for every i € J as required. 


Step 4 There is a partition of unity subordinate to the cover {Uy}aeA. 


Let {yi hic; be the partition of unity constructed in Step 3. By the axiom of choice 
there isa map J > A:it> a; such that V; C Uy, for every i € J. Fora € A define 
6, : M — [0, 1] by 


Ay = > Xi- 


Here the sum runs over all indices i € J with a; = a. This sum is locally finite 
and hence is a smooth function on M. Moreover, each point in M has an open 
neighborhood in which only finitely many of the 6, do not vanish. Hence the sum 
of the 6, is a well defined function on M and 


Y= >>> w= >on Hl. 


acA acd aj=a iel 


This shows that the functions 6, form a partition of unity. To prove the inclusion 
supp(6,) C Uy we consider the open sets 


Wi :={p €M|xi(p) > 0} 


fori € I. Since W; C JV; this collection is locally finite. Hence, by Lemma 2.9.11, 
we have 


supp) = L) W = LJ Wi = LU supp) c LU ic Ua. 


aj=a aj=a aj=a aj=a 


This proves Theorem 2.9.9. O 


2.9.3. Embedding in Euclidean Space 


Theorem 2.9.12 Let M be a second countable smooth m-manifold with a Haus- 
dorff topology. Then there exists an embedding f : M — R*"*! with a closed 
image. 
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Step 1 Let U C M be an open set and let K C U be a compact set. Then there 
exists an integer k € N, a smooth map f : M — R*, and an open set V C M, 
such that K CV CU, the restriction f|y : V — R* is an injective immersion, 
and f(p) = Ofor all pe M \ U. 


Choose a smooth atlas A = {(¢9, Ux)}we4 on M such that, for each a € A, either 
Uy, CU or UN K = @. Since M is a paracompact Hausdorff manifold, Theo- 
rem 2.9.9 asserts that there exists a partition of unity {6,}v<4 subordinate to the 
open cover {Uytaca of M. Since the sets {p € Uy | Oa(p) > 0} with U, Cc U form 
an open cover of K and K is a compact subset of M, there exist finitely many 
indices a@,,...,a@¢ € A such that 


KC {pe M | 6q,(p) +--- + G,(p)) > Oh} = V CU. 
Let k := €(m + 1) and, fori = 1,..., £, abbreviate 
$1 = $a, = Boy. 
Define the smooth map f : M > R* by 


al (p) 
A (p)bi(p) 


f(p) = : for p € M. 
(Pp) 
8e(p)be(P) 


Then the restriction f|y : V > R* is injective. Namely, if po, p1 € V satisfy 


S (po) = f(p1), 


then 
I := {i | (po) > 0} = {i | %(p1) > 0} 4B 


and, fori € I, we have 6;(po) = 9 (pi), hence $; (po) = ¢;(p1), and so po = Pi. 
Moreover, for every p € V the derivative df(p) : T,M > Ré is injective, and this 
proves Step 1. 


Step 2 Let f : M — R* be an injective immersion and let A C RO”+)xk 
be a nonempty open set. Then there exists a matrix A € A such that the map 
Af : M — R*"*! is an injective immersion. 
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The proof of Step 2 uses the Theorem of Sard (see [1, 50]). The sets 


Wo := (9.9) € Mx M | p #q}, 
W, := {(p,v) € TM |v £0} 


are open subsets of smooth second countable Hausdorff 2m-manifolds and the maps 
Fo: AxWo>R™t, Fi: AxWoR”, 
defined by 
Fo(A, p,q) = A(f(p)— fq), Fi(A, pv) := Adf(p)v 
for A € A, (p,q) € Wo, and (p,v) € W,, are smooth. Moreover, the zero vector 
in R7”*! is a regular value of Fy because f is injective and of F, because f is an 


immersion. Hence it follows from the intrinsic analogue of Theorem 2.2.19 that the 
sets 


Mo := Fy"'(0) = {(A, p,q) € Ax Wo| Af(p) = AF@}. 
My = F,'(0) = {(A, pv) € Ax W, | Adf(p)v = 0} 


are smooth manifolds of dimension 
dim Mo = dim ™, = (2m + 1)k —1. 


Since M is a second countable Hausdorff manifold, so are Mo and ™,. Hence the 
Theorem of Sard asserts that the canonical projections 


My > A: (A, p,q) > A =: m0(A, Pg), 
M, > A:(A,p,v) Rh A=: 71(A, p,v), 


have a common regular value A € A. Since 
dim My = dim ™, < dim A, 
this implies 
A€ A \ (m0(Mo) U m(™))). 
Hence Af : M — R?"*! is an injective immersion and this proves Step 2. 
If M is compact, the result follows from Steps 1 and 2 with K = U = M. In 


the noncompact case the proof requires two more steps to construct an embedding 
into R“”** and a further step to reduce the dimension to 2m + 1. 
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Step 3 Assume M is not compact. Then there exists a sequence of open sets U; C M, 
a sequence of smooth functions p; :M — [0,1], and a sequence of compact 
sets K; C U; such that 


supp(p;) C U;, K; = p;'(1) c U;, U,NU; = 


foralli, j €N with |i — j|>2andM =, Ki. 


Since M is second countable, there exists a sequence of compact sets C; C M such 
that C; C int(C;,) for alli ¢ N and M = Ld ae C; (Remark 2.9.1). Define the 
compact sets B; C M by Co := @ and 


B; = C; \ C4 fori Ee N. 
Then M = Uren B; and, for alli, 7 ¢ N with j >i + 2, we have 
B; Cc C; Cc int(Cj_1), B; Gc C \ int(C;_1) 


and so B} NB; = ®. Since M is metrizable by Remark 2.9.2, there exists a distance 
function d : M x M = (0, oo) that induces the intrinsic topology on M. Define 


Aj = e, B; 6; = d(A;, B)) = joi d(p,q). 
jEeN\f{i-Lii+) 


Then A; is a closed subset of M, because any convergent sequence in M must 
belong to a finite union of the B;. Since A; N B; = 9, this implies ¢; > 0. Fori ¢ N 
define the set U; C M by 


Uj, := {p eM | there exists aq € B; with d(p,q) < 6; /3}. 


Then {U;};cn is a sequence of open subsets of M such that B; C U; C C;+, for 
alli ¢ N and U; NU; = @ for |i — j| = 2. In particular, each set U; has a compact 
closure. 

For each 7 there exists of a partition of unity subordinate to the open cover 
M =U; U(M \ B;) and hence a smooth function p; : M — [0,1] such that 
supp(p;) C U; and p;|z, = 1. Define K; := p;'(1) = {p € U; | p;(p) = 1} for 
i ¢ N. Then K; is a compact set and B; C K; C U; for each i € N. Hence 
M = U,en Ki and this proves Step 3. 


Step 4 Assume M is not compact. Then there exists an embedding 
ha -_Ms> Ramt4 


with a closed image and a pair of orthonormal vectors x, y € R4*"** such that, for 
every € > 0, there exists a compact set K C M with 


inf S(p) 


sup —sx—ty| <e. (2.9.3) 
pem\K steR| | f (p)| 
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Assume M is not compact and let K;, U;,; be as in Step 3. Then, by Steps 1 
and 2, there exists a sequence of smooth maps g; : M — R?”*! such that 
gilm\u, = 9, the restriction g;|x, : Ki > R2"+! js injective, and the derivative 
dgi(p) : TyM — R"*! is injective for all p € K; and alli e¢ N. Let € ¢ R?”*! 
be a unit vector and define the maps f; : M — R?”*! by 


Si(p) -= pi(p)| iE + esa 2 (2.9.4) 


1+ |gi(p)? 


for p € M andi €N. Then the restriction f;|x, : Kj > R?”*! is injective, the 
derivative df;(p) : T,M — R"*! is injective for all p € K;, and 


supp(fi) C Uj, Si(Ki) C Bi (7), Si(M) C Bi+1(0). 


Define the maps f%*, f° : M — R?”+! and p4, oY : M > R by 


p4(p) = | ag ifi ¢ N and p € Up;_-1, 


0, if p € M \ Usen Uai-i. 
f(y) — Soi-1 (Pp), ifi e N and p € Udi-1, 
0, if p € M \ U;en Voi-s, 
0 (p) = poi(p), ifi ¢ N and p € Uy;, 
0, if p € M \ Usen Uri, 
fp) = hoi (Dp), ifi e N and p € U;, 
: 0, ifp € M \ Usen Vai. 


and define the map f : M > R*”*+4 by 


IDAHO? OF OO. OD) 


forpe M. 

We prove that f is injective. To see this, note that 
2i -2< | f4(p)| < 23, 
[f(p)| < 20 + 1, 


2i=1 =| f"(p)| < 27 +1, 
| fo%(p)| < 2i + 2, 


Pp & Kaj-1 => | 
(2.9.5) 
pe K; => | 


Now let po, p1 € M such that f(po) = f(p1). Assume first that po € K>;_1. 
Then p™4(p,) = p%4(po) = 1 and hence p; € Uien K;_1. By (2.9.5), we also 
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have 2i —2 < | f°4(p,)| = | f°“4(po)| < 2i and hence p, € K>;_1. This implies 
fi-r(Py) = f°" (pid = f°"(po) = fri-1(Po) and so po = pi. Now assume 
Po € Ko;. Then p*’(p1) = p*’(po) = 1 and hence p; € icN K;. By (2.9.5), we 
also have 2i — 1 < | f*¥(p,)| = | f° (po)| < 2i + 1, so py € K2;, which implies 
fri (Di) = fpr) = f°"(po) = fri (Po), and so again py = pi. This shows that f 
is injective. That f is an immersion follows from the fact that the derivative df; (p) 
is injective for all p € Kj; andalli ¢ N. 

We prove that f is proper and has a closed image. Let (p)),en be a sequence 
in M such that the sequence (f(p)))ven in R*”** is bounded. Choose i € ay — 
that | f°44(p,)| < 2i and | f°(p,)| < 2i + 1 for all v € N. Then p, € We 
for all v € N by (2.9.5). Hence (p,)ven has a convergent subsequence. ae 
f : M — R*"*+* is an embedding with a closed image. 

Next consider the pair of orthonormal vectors 


= (0, £,0,0), y := (0,0,0, &) 


in R4#"+4 = R x R41 x R x R?”"*!, Let (p,),en be a sequence in M that 
does not have a convergent subsequence and choose a sequence i, € N such 
that p, € Ko;,-1 U K;, for all v €¢ N. Then 7, tends to infinity. If p, € Koj,-1 
for all v, then we have limsup,,,., | f(p,)|7'|f(py)| < 1 by (2.9.5). Pass- 
ing to a subsequence, still denoted by (p),en, we may assume that the limit 
A = Timy 500 |f (ft) "| fF (py)| exists. Then 


odd f 1 ev 7 A 
iat, OO). mn OI _ 


, lim = : 
v>oo | f(pv)| VI+A2 vo | f(py)| V1 42? 
and it follows from (2.9.4) that 


a fe) S° (Dp) 
fim ae = 9. 
ae [Fem A Temp, y] AF 
This implies 
_ f(Py) ( E rE ) 1 i. 
; ={6, 0: = 
ve [f(py)| Vick Vite) View’ ieee 


Similarly, if p, € Ko;, for all v, there exists a subsequence such that the limit 
A := limy-sc0 | f (pv)! | f°“ (p,)| exists and, by (2.9.4), this implies 


ee Fee = (0 a )- A, _! ; 
ve | f(py)| "JTF 22 JT 422 JI#22  VJ/12 22 


This shows that the vectors x and y satisfy the requirements of Step 4. 
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Step 5 There exists an embedding f : M > R?"*! with a closed image. 


For compact manifolds the result was proved in Steps | and 2 and for m = 0 the 
assertion is obvious, because then M is a finite or countable set with the discrete 
topology. Thus assume that M is not compact andm > 1. Choose f : M > R*"*+4 
and x, y € R*”"*4 as in Step 4 and define 


the vectors Ax and Ay 


Ae Rem+)xdm+4) ; 
are linearly independent 


A:= 


Since m > 1, this is a nonempty open subset of R@”+)*G"+4)_ We prove that the 
map Af : M — R?”*! is proper and has a closed image for every A € A. To see 
this, fix a matrix A € A. Let (p,),en be a sequence in M that does not have a 
convergent subsequence. Then by Step 4 there exists a subsequence, still denoted 
by (py) ven, and real numbers s,¢ € R such that 


r+rP=1, lim HP») =sx+ty, lim | f(py)| = ©. 
v+oe | f(py)| ve 
This implies 
Af (pv) 


im = sAx+tAy £0 
vee | f(py)| 


and hence lim, .../A/f(p,)| = oo. Thus the preimage of every compact sub- 
set of R?”*! under the map Af : M — R””*! is a compact subset of M, and 
hence Af is proper and has a closed image (Remark 2.3.3). 

Now it follows from Step 2 that there exists a matrix A € A such that the 
map Af : M — R?'*! is an injective immersion. Hence it is an embedding with a 
closed image. This proves Step 5 and Theorem 2.9.12. Oo 


The Whitney Embedding Theorem asserts that every second countable Haus- 
dorff m-manifold M admits an embedding f : M — R””. The proof is based on 
the Whitney Trick and goes beyond the scope of this book. The next exercise shows 
that Whitney’s theorem is sharp. 


Remark 2.9.13 The manifold RP? cannot be embedded into R*. The same is 
true for the Klein bottle K := R*/ = where the equivalence relation is given by 
[x,y] = [x +k,€—y] forx,y € Randk,£ € Z. 


2.9.4 Leaves of a Foliation 


Let M be an m-dimensional paracompact Hausdorff manifold and E C TM be 
an integrable subbundle of rank n. Let L C M be a closed leaf of the foliation 
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determined by E. Then L is a smooth n-dimensional submanifold of M. Here is a 
sketch of David Epstein’s proof of this fact in [19]. 


(a) The space L with the intrinsic topology admits the structure of a manifold such 
that the obvious inclusion t: L — M is an injective immersion. This is an easy 
exercise. For the definition of the intrinsic topology see Exercise 2.7.10. The 
dimension of L is n. 

(b) If f : X — Y is a continuous map between topological spaces such that Y 
is paracompact and there is an open cover {V;}j<s of Y such that f~'(V;) is 
paracompact for each j, then X is paracompact. To see this, we may assume 
that the cover {V;}j<, is locally finite. Now let {Uy}ae4 be an open cover of X. 
Then the sets U, M f~'(V;) define an open cover of f~!(V;). Choose a locally 
finite refinement {Wj };<z, of this cover. Then the open cover {W;;}jcu,ier, of 
M isa locally finite refinement of {Uy }ye4. 

(c) The intrinsic topology of L is paracompact. This follows from (b) and the fact 
that the intersection of L with every foliation box is paracompact in the intrinsic 
topology. 

(d) The intrinsic topology of L is second countable. This follows from (a) and (c) 
and the fact that every connected paracompact manifold is second countable 
(see Remark 2.9.2). 

(e) The intersection of L with a foliation box consists of at most countably many 
connected components. This follows immediately from (d). 

(f) If L is a closed subset of M, then the intersection of L with a foliation box has 
only finitely many connected components. To see this, we choose a transverse 
slice of the foliation at pp € L, 1.e. a connected submanifold T C M through po, 
diffeomorphic to an open ball in R””, whose tangent space at each point p € T 
is acomplement of E,,. By (d) we have that TM L is at most countable. If this set 
is not finite, even after shrinking 7’, there must be a sequence p; € (TN L)\ {po} 
converging to po. Using the holonomy of the leaf (obtained by transporting 
transverse slices along a curve via a lifting argument) we find that every point 
p€TO Lis the limit point of a sequence in (J 1 L) \ {p}. Hence the one-point 
set { p} has empty interior in the relative topology of TL foreach pe TOL. 
Thus 7M L is a countable union of closed subsets with empty interior. Since 
T OL admits the structure of a complete metric space, this contradicts the Baire 
category theorem. 

(g) It follows immediately from (f) that L is a submanifold of M. 


2.9.5 Principal Bundles 
An interesting class of foliations arises from smooth Lie group actions. Let 


G Cc GL(N, R) be a compact Lie group and let P be a smooth m-manifold whose 
topology is Hausdorff and second countable. A smooth (contravariant) G-action 
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on P is asmooth map 
PxG>P:(p,g) pg (2.9.6) 


that satisfies the conditions 


(pg)h = p(gh), pl=p (2.9.7) 


for all p € P and all g,h €G. Fix any such group action. Then every group 
element g € G determines a diffeomorphism P — P : p+» pg, whose deriva- 
tive at p € P is denoted by 7,P > T,,P:vt> vg. Every Lie algebra el- 
ement € € g:= Lie(G) = 7;G determines a vector field X; € Vect(P) which 
assigns to each p € P the tangent vector 


d 

X¢(p) = pé:= 7 pexp(té) € T, P. (2.9.8) 
1=0 

The linear map g — Vect(P) : € +» X¢ is called the infinitesimal action. It is a Lie 

algebra anti-homomorphism because the group action is contravariant. (Exercise: 

Prove that [X¢, X,] = —Xjg,9| for €, € g.) The group action (2.9.6) is said to be 

with finite isotropy iff the isotropy subgroup 


Gy, = {g €G| pg = p} 


is finite for all p € P. The isotropy subgroup G, is a Lie subgroup of G with 
Lie algebra g, := Lie(G,) = {é €g|Xe(p) = O}. Since G is compact, this 
shows that G, is a finite subgroup of G if and only if g, = {0} or, equivalently, 
the map g > 7,P:&+> X¢(p) = pé is injective. Thus, in the finite isotropy 
case, the group action determines an involutive subbundle E C TP with the 
fibers E, := pg = {X:(p) |Ee g} for p € P. When G is connected, the leaves of 
the corresponding foliation are the group orbits pG := {pg |g € G}. These are the 
elements of the orbit space 


P/G:= {pG|pe P}. 


There is a natural projection z : P ~ P/G defined by z(p) := pG for p € P and 
the orbit space P/G is equipped with the quotient topology (a subset U C P/G 
is open if and only if z~!(U) is an open subset of P). The group action is called 
free iff G, = {1} for all p € P. The next theorem shows that, in the case of a free 
action, the quotient space admits a unique smooth structure such that the projec- 
tion z : P + P/Gisa submersion. 


Theorem 2.9.14 (Principal Bundle) Let P be a smooth m-manifold whose topol- 
ogy is Hausdorff and second countable. Suppose P is equipped with a smooth 
contravariant action of a compact Lie group G and assume the group action is 
free. Then dim(G) < mand B := P/G admits a unique smooth structure such that 
the projection m : P — B is a submersion. The intrinsic topology of B, induced 
by the smooth structure, agrees with the quotient topology, and it is Hausdorff and 
second countable. 
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Proof For each p € P the map G > P: g + pg is an embedding and this im- 
plies k := dim(G) < dim(P) = m. Define n := m — k. A local slice of the group 
action is a smooth map 1: §2 — P, defined on an open set {2 C R", such that 
the map 2 x G—> P: (x, g)  t(x)g is an embedding. With this understood, we 
prove the assertions in five steps. 


Step 1 For every po € P there exists a local slice tg : S29 — P, defined on an open 
neighborhood S29 C R" of the origin, such that to(0) = Po. 


Choose a coordinate chart ¢ : V — R” on an open neighborhood V C P of po 
such that ¢(po) = 0 and @(V) = R”. Define v1,..., Um, € TP by 


ddb(po)vi = e; fori = 1,...,m, 
where @,...,@» is the standard basis of IR”. Reorder the coordinates on R’, 
if necessary, such that the vectors vj,...,U, project to a basis of the quotient 
space T,, P/ pog. Define i : R" > P by 
ies in Xe) = Oy os Bap Os, 0) 
and define the map y : R” x G > P by 


w(x, g) :=U(x)g forx € R" andg €G. 


Then yw is smooth and its derivative dy (0,1) : R" x g > T,,, P is given by 
dw(0, 1)@,§) = D0 Xv; + pok 
i=l 


for ¥ = (X1,...,%,) € R” and &€ € g. Hence dy(0, 1) is bijective and so it fol- 
lows from the Inverse Function Theorem 2.2.17 that there exist open neighbor- 
hoods §2 C R” of 0, 2; C Gof 1, and W Cc P of po such that the restricted map 


Wi = Wlexa, > 2x 22; —>W 
is a diffeomorphism. 


Next we prove that there exists an open neigborhood 29 C 2 of the origin such 
that the restricted map 


Wo i= Wiaxa : 2 x G— P 
is injective. Suppose otherwise that no such neighborhood {29 exists. Then there 


exist sequences (x;, gj), (x;,g;) € @ x G such that (x;,g;) 4 (x;,g}) and 
w(x, 21) = W(x}, g;) for all i and the sequences (x;)j;en and (x!);en in 2 converge 
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to the origin. Since G is compact we may assume, by passing to a subsequence if 
necessary, that the sequences (g;);en and (g;);cen converge. Denote the limits by 


g:= lim g; €G, g’ := lim g} €G. 
Then 
Pog = lim 1(x;)gi = lim e(x;)g; = pog’ 


and so g = g’ because the group action is free. Thus the sequence (g/g>');en inG 
converges to 1 and hence belongs to the set §2, fori sufficiently large. Since 


Wixi, 1) = e0;) = u(x) gigs! = Wie}, gig7') 


for all 7, this contradicts the injectivity of y,. Thus we have proved that the 
map Wo : 29 x G => P is injective for a suitable neighborhood 2 C @ of the 
origin. That it is an immersion is a direct consequence of the formula 


dwWo(x, g)\(®,@) = (du(x)® + u(x)(Eg'))g = (dWo(x. DG. Fg"'))g 


for all x € 20, X € R", g €G, and g € T,G, and the fact that the derivative dWo(x, 1) 
is bijective for all x € {2p (even for all x € £2). 

Thus we have proved that yo : 29 x G > P is an injective immersion. Shrinking 
§2o further, if necessary, we may assume that {29 has a compact closure and that y 
is injective on Qo x G. This implies that Wo is proper. Namely, if (x;, g;);en is a 
sequence in {29 x Gand (x, g) € 2 x Gsuch that Wo(x, g) = limj+oo Wo(%), gi), 
then there is a subsequence (x;,, g;,)ven that converges to a pair (x’, g’) € 29 x G. 
This subsequence satisfies 


wx’, g') = lim Yo(xi,. i.) = WO, 8). 


Since y is injective on 2o x G, this implies x = x’ and g = g’. Thus every subse- 
quence of (x;, g;)icN has a further subsequence that converges to (x, g) and so the 
sequence (x;, 2;)icN itself converges to (x, g). Thus the map Wo : 29 x G—> P is 
a proper injective immersion and this proves Step 1. 


Step 2 Lett: 82 — P bea local slice. Then the set U := m(t(82)) C B is open in 
the quotient topology and the map m 01: 82 — U isa homeomorphism with respect 
to the quotient topology on U. 


The map y : 2x G— P, defined by w(x, g) := U(x)g for x € 2 and g €G, 
is an embedding. Hence W := w(S2 x G) is an open G-invariant subset of P 
and w : 2 x G > W is a G-equivariant homeomorphism. Moreover, for every ele- 
ment p € P, we have z(p) € U if and only if there exists an element x € §2 and an 
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element g € G such that p = 1(x)g = W(x, g). Thus m7'(U) = W(Q x G) = W 
is an open subset of P, and so U is an open subset of B = P/G with re- 
spect to the quotient topology. The continuity of z o1: 2 — U follows directly 
from the definition. Moreover, if 2’ C 2 is an open set and U' := m(u(22’)), 
then z~!(U’) = w(Q’ x G) is open by the same argument, and so U’ C B is open 
with respect to the quotient topology. Thus 7 01: 2 — U is a homeomorphism 
and this proves Step 2. 


Step 3 By Step 1 there exists a collection ty : Qa > P, a € A, of local slices such 
that the sets Uy := 1(ta(_)) cover the orbit space B = P/G. Fora € A define 


by := (1 or ie 2Uy > Qq. 


Then A = {(¢u, Uw)}aea is a smooth structure on B which renders the canonical 
projection m : P — B into a submersion. Moreover, this smooth structure is com- 
patible with the quotient topology on B. 


For a, 6 € A define Qeg := da(Uy M Up) and dpa := op 0 by! : Rap > Lga- 
We must prove that gq is smooth. To see this, define Wy : 2g x G > P by 
Wa(X, g) := la(x)g fora € A, x € Qy, and g € G. Then We is a diffeomorphism 
onto its image and Wy (S2ag x G) = Wp(2ba x G) = ma! (Uy Ug). For x € Qop the 
element g(x) € Qgq is the projection of We © Wa (x, 1) onto the first factor. Thus 
ga is smooth and so is its inverse ¢yg. This shows that {(Uy, da)}aea iS a smooth 
structure on B. Second, z is a submersion with respect to this smooth structure, 
because dy 0 1 0 Wy(x,g) = x foralla € A, all x € Qy, and all g € G. Third, this 
smooth structure is compatible with the quotient topology by Step 2. This proves 
Step 3. 


Step 4 There is only one smooth structure on B with respect to which the projec- 
tion x : P — B is a submersion. 


Fix any smooth structure on B for which the projection z : P — B is asubmersion. 
Then the dimension of B is n = dim(P) — dim(G), and so the smooth structure 
consists of bijections dy : Uy > 2, from subsets Uy C B onto open sets 2y C R” 
such that the sets U, cover B and the transition maps are diffeomorphisms between 
open subsets of R”. 

We prove that the intrinsic topology on B agrees with the quotient topology. To 
see this, fix a subset U C B. Then the following are equivalent. 


(a) U is open with respect to the intrinsic topology on B. 
(b) dg(U N U,) is open in R" for alla € A. 

(c) m~!(U MN Uy) is open in P for alla € A. 

(d) z~!(U) is open in P. 

(e) U is open with respect to the quotient topology on B. 
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The equivalence of (a) and (b) follows from the definition of the intrinsic topology. 
That (b) implies (c) follows from the three observations that the set a! (Uy) is open 
in P, the map ¢,07 :2~!(Uy) > Q, is continuous, and (¢y 07)7!(¢g(U NUg)) = 
a '(U M U,). That (c) implies (b) follows from the fact that the map ¢y oz : 
a '(Uy) — 94 is a submersion and hence maps the open set 2~!(U M U,) onto an 
open subset of 2, (Corollary 2.6.2). The equivalence of (c) and (d) follows from 
the fact that the map a : P — B is continuous and U, C B is open (both with 
respect to the intrinsic topology on B) and so 2~!(U,) is open in P for alla € A. 
The equivalence of (d) and (e) follows from the definition of the quotient topology 
on B. 

Now let «: 2 — P bea local slice and define the set U := m(u(2)) C B and 
the map ¢ := (a 01)7! : U > Q. Then the composition 


boo | = b400 01: 6(U NUg) > ba(U N Ug) 


is a homeomorphism between open subsets of R”. Moreover, ¢, 07! is the 
composition of the smooth maps c: {x € 2|(i(x)) € Uy} > aw 1(U NU,), 
x:m (UNUy) > UNUg, and dy: U NUy > bg(U N Uy). So by ¢7! is 
smooth and its derivative is everywhere bijective because mz is a submersion and 
the kernel of dz(i(x)) is transverse to the image of di(x). Thus 0 ¢7! is a 
diffeomorphism by the Inverse Function Theorem and this proves Step 4. 


Step 5 The quotient topology on B is a Hausdorff and second countable. 


Let ty : 2, > P fora € A be acollection of local slices such that the sets Uy, := 
I(ty(Qy)) cover B. Then the open sets 2~'(U,) form an open cover of P and 
so there is a countable subcover. Thus B is second countable. To prove that B 
is Hausdorff, fix two distinct elements bp,b; € B and choose po, py € P such 
that (po) = bo and z(p,) = b,. Then poG and p;G are disjoint compact subsets 
of P and hence can be separated by disjoint open subsets Up, U; C P, because P 
is a Hausdorff space. Now for i = 0,1 the set Vj := {p € P| pG C U;} is open 
(exercise) and contains the orbit p;G. Hence Wy := m(Vo) and W, := 2(V,) are 
disjoint open subsets of B such that by) € Wo and b, € W,. This proves Step 5 and 
Theorem 2.9.14. Oo 


Example 2.9.15 There are many important examples of free group actions 
and principal bundles. A class of examples arises from orthonormal frame bun- 
dles (Sect. 3.4). The complex projective space B = CP” arises from the action of 
the circle G = S! on the unit sphere P = S?”+! c C”+! (Example 2.8.5). The 
real projective space B = RP” arises from the action of the finite group G = Z/2Z 
on the unit sphere P = S” C R"*t! (Example 2.8.6). The complex Grassman- 
nian B = G;(C") arises from the action of G = U(k) on the space P = F,(C") 
of unitary k-frames in C” (Example 3.7.6). If G is a Lie group and K C G is 
a compact subgroup, then by Theorem 2.9.14 the homogeneous space G/K 
admits a unique smooth structure such that the projection 2 :G— G/K is a 
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submersion. The example SL(2,C)/SU(2) can be identified with hyperbolic 3- 
space (Sect. 6.4.4), the example U(n)/O(n) can be identified with the space of 
Lagrangian subspaces of a symplectic vector space ([49, Lemma 2.3.2]), the ex- 
ample Sp(2n)/U(n) can be identified with Siegel upper half space or the space of 
compatible linear complex structures on a symplectic vector space (Exercise 6.5.24 
and [49, Lemma 2.5.12]), and the example G2/SO(4) can be identified with the 
associative Grassmannian ([68, Remark 8.4]). 


2.9.6 Standing Assumption 


We have seen that all the results in the present chapter carry over to the intrinsic 
setting, assuming that the topology of M is Hausdorff and paracompact. In fact, in 
many cases it is enough to assume the Hausdorff property. However, these results 
mainly deal with introducing the basic concepts such as smooth maps, embeddings, 
submersions, vector fields, flows, and verifying their elementary properties, i.e. with 
setting up the language for differential geometry and topology. When it comes to 
the substance of the subject we shall deal with Riemannian metrics and they only 
exist on paracompact Hausdorff manifolds. Another central ingredient in differen- 
tial topology is the theorem of Sard and that requires second countability. To quote 
Moe Hirsch [29]: “Manifolds that are not paracompact are amusing, but they never 
occur naturally and it is difficult to prove anything about them.” Thus we will set 
the following convention for the remaining chapters. 


We assume from now on that each intrinsic manifold M 
is Hausdorff and second countable and hence is also paracompact. 


For most of this text we will in fact continue to develop the theory for submanifolds 
of Euclidean space and indicate, wherever necessary, how to extend the definitions, 
theorems, and proofs to the intrinsic setting. 


® 


Check for 
updates 


The Levi-Civita Connection 


For a submanifold of Euclidean space the inner product on the ambient space de- 
termines an inner product on each tangent space, the first fundamental form. The 
second fundamental form is obtained by differentiating the map which assigns to 
each point in M C R” the orthogonal projection onto the tangent space (Sect. 3.1). 
The covariant derivative of a vector field along a curve is the orthogonal projection 
of the derivative in the ambient space onto the tangent space (Sect. 3.2). We will 
show how the covariant derivative gives rise to parallel transport (Sect. 3.3), ex- 
amine the frame bundle (Sect. 3.4), discuss motions without “sliding, twisting, and 
wobbling”, and prove the development theorem (Sect. 3.5). 

In Sect. 3.6 we will see that the covariant derivative is determined by the 
Christoffel symbols in local coordinates and thus carries over to the intrinsic setting. 
The intrinsic setting of Riemannian manifolds is explained in Sect. 3.7. The covari- 
ant derivative takes the form of a family of linear operators V : Vect(y) — Vect(y), 
one for every smooth curve y : J — M, and these operators are uniquely charac- 
terized by the axioms of Theorem 3.7.8. This family of linear operators is the 
Levi-Civita connection. 


3.1 Second Fundamental Form 


Let M Cc R” be a smooth m-manifold. Then each tangent space of M is an m- 
dimensional real vector space and hence is isomorphic to R’”. Thus any two tangent 
spaces T,M and T,M are of course isomorphic to each other. While there is no 
canonical isomorphism from 7, M to T,M we shall see that every smooth curve 
y in M connecting p to g induces an isomorphism between the tangent spaces via 
parallel transport of tangent vectors along y. 

Throughout we use the standard inner product on R” given by 


(v, w) = VW + VQW2 +++ + Uy Wy 
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for v = (vj,...,U,) € R" and w = (wj,..., Wy) € R”. The associated Euclidean 
norm will be denoted by 


l= Ve = Yui toh te $03 


for v = (v1,...,Un) € R”. When M C R” is a smooth m-dimensional submani- 
fold, a first observation is that each tangent space of M inherits an inner product 
from the ambient space R”. The resulting field of inner products is called the first 
fundamental form. 


Definition 3.1.1 Let M@ C R” be a smooth m-dimensional submanifold. The first 
fundamental form on M is the field which assigns to each p € M the bilinear map 


&p:1,M xT,M—>R 
defined by 
gp(v, w) = (v,w) (3.1.1) 
forv,w € T,M. 

A second observation is that the inner product on the ambient space also deter- 
mines an orthogonal projection of IR” onto the tangent space 7, M foreach p « M. 
This projection can be represented by the matrix J7(p) € R”*" which is uniquely 
determined by the conditions 

II(p) = II(p)’ = I1(p), (3.1.2) 
and 
II(p)v =v => veT,M (3.1.3) 
for p € M and v € R” (see Exercise 2.6.9). 
Lemma 3.1.2 The map IT : M > R"*" defined by (3.1.2) and (3.1.3) is smooth. 
Proof This follows directly from Theorem 2.6.10 and Corollary 2.6.12. More ex- 


plicitly, if U C M is an open set and ¢@ : U — 2 is acoordinate chart onto an open 
subset 2 C R” with the inverse y := ¢-' : 2 > U, then 


Mp) = dvolp) (AvOW"dVO)) Avo" 


for p € U and this proves Lemma 3.1.2. O 
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Fig. 3.1 A unit normal vector 
field 


Example 3.1.3 (Gau® map) Let M C R’”*! be a submanifold of codimension 
one. Then 7M + is a vector bundle of rank one (Corollary 2.6.13), and so each 
fiber 7, M + is spanned by a unit vector v(p) € R”, determined by T,M up to 
a sign. By Theorem 2.6.10 each po € M has an open neighborhood U Cc M on 
which there exists a smooth map 


viU > R™ 
satisfying 
v(p) LT,M, |v(p)| = 1 (3.1.4) 


for all p € U (see Fig. 3.1). Such a map v is called a Gau8 map. The function 
IT: M — R"*" is in this case given by 


T(p) = 1-v(p)vo(p)' (3.455) 


for p € U. 


Example 3.1.4 Let M = S? C R?. Then v(p) = p and so 


1—x? —xy —XxZ 
II(p)=1—pp'=| -yx 1l-y? -yz 
—zx —zy 1-2? 


for p = (x, y,z) € S?. 


Example 3.1.5 (M6bius strip) Consider the submanifold 


x = (1 + rcos(6@/2)) cos(9), 
M := 2 (x,y,z) € R® y = (1+ rcos(0/2)) sin(@), 
z=rsin(@/2),r,0 ER, |r| <é 


for ¢ > 0 sufficiently small. Show that there does not exist a global smooth function 
v:M — R3 satisfying (3.1.4). 


Example 3.1.6 Let U C R” be an open set and f : U > R”” be a smooth 
function such that 0 € R’~” is a regular value of f and UM = f7~1(0). Then 
T,M = kerdf(p) and 


II(p) = 1—df(p)' (df(p)df(p)') “ df(p) 
forevery p © UNM. 
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Example 3.1.7 Let 82 Cc R” be an open set and y : 82 — M beasmooth embed- 
ding. Then Ty x) M = imdy(x) and 


TW (x)) = dW(x)(dv)"dv)) dv)" 
for every x € Q. 


Next we differentiate the map I7 : M — R”*" in Lemma 3.1.2. The derivative 
at p € M takes the form of a linear map 


dII(p) : T,M + Ri 


which, as usual, is defined by 


d 
dIT(p)v := Ai IT(y(t)) € R"™ 
Eley 


t 
for v ¢ T,M, where y : R — M is chosen such that y(0) = p and y(0) = v (see 
Definition 2.2.13). We emphasize that the expression d/T(p)v is a matrix and can 
therefore be multiplied by a vector in R”. 
Lemma 3.1.8 For all p ¢ M andv,w € T,M we have 
(dIT(p)v) w = (dIT(p)w)v € T,M*. 


Proof Choose a smooth path y : R — M and a vector field X : R — R” along y 
such that 


yvO=p, yO)=v, X(0)=w. 
For example, we can choose X(t) := IT(y(t))w. Then 


X(t) = M(y(t)) X(t) 
for every ¢ € R. Differentiate this equation to obtain 
X(t) = My) XW) + AYO) 0) XO. (3.1.6) 
Hence 
(41(v)¥O) XH = (L-TYO))X) € TyM~> B17) 


for every t € R and, with t = 0, we obtain (d/T(p)v)w € T,M¢. 
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Now choose a smooth map 
R? > M :(s,t) & y(s,t) 
satisfying 
y(0,0) = p, 2.0.0) = 20.0) = w, 


(for example by doing this in local coordinates) and denote 


a a 
Xe H4= 5.0) ETysyM,  —-Y(s,t) = a 0) E Tye) M. 
Then 


oY = ay _ ax 
ads osdt Ot 


and hence, using (3.1.7), we obtain 


= (1- (y)) a 
= (I-10) 
- (anm2)y 
- (aa) ae 


With s = t = 0 we obtain 
(dII(p)w)v = (dIT(p)v)w € T,M* 


and this proves Lemma 3.1.8. Oo 


Definition 3.1.9 The collection of symmetric bilinear maps 
hyp: TyM X T,M > T,M-, 
defined by 
h,(v, w) := (dIT(p)v)w = (dIT(p)w)v (3.1.8) 


for p € M and v, w € T,M is called the second fundamental form on M. 
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Example 3.1.10 Let M Cc R”*! be an m-manifold and v : M > S” be a GauB 
map so that 7, M = v(p)* for every p € M (see Example 3.1.3). Then IT(p) = 
1—v(p)v(p)! and hence 


hp(v, w) = —v(p)(dv(p)v, w) 
for p ¢ M andv,w €T,M. 
Exercise 3.1.11 Choose a splitting R” = R” x R"”~” and write the elements of 
R” as tuples (x, y) = (X1,..-,Xm,V1,--+;¥n—-m) Let M C R"” be a smooth m- 
dimensional submanifold such that p = 0 € M and 


TyM = R™ x {03, Ty)M+ = {0}x R™™. 


By the implicit function theorem, there are open neighborhoods 92 Cc R” and 
V CR" of zero and a smooth map f : 2 — V such that 


M1 (82 x V) = graph(f) = {(x, f(x)) |x € 2}. 


Thus /(0) = 0 and df(0) = 0. Prove that the second fundamental form hp : 
T,M xT,M > T,M + is given by the second derivatives of /, i.e. 


2 


m a 7 
hp(v, w) = | 0, (0) v; w; 
4 Xu Ox; 0X; 7 


for v,w € 7,M = R”™ x {0}. 


Exercise 3.1.12 Let M C R” be an m-manifold. Fix a point p € M and a unit 
tangent vector v € 7,M so that |v| = 1 and define 


L:={pt+tu+w|teR, wl TM}. 


Let y : (—é,€) ~ MOL bea smooth curve such that y(0) = p, y(0) = v, and 
|y(t)| = 1 for all t. Prove that 


y(0) = hy(v, v). 


Draw a picture of M and L in the case n = 3 andm = 2. 
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3.2 Covariant Derivative 


Definition 3.2.1. Let 7 C R be an open interval and let y : J — M be a smooth 
curve. A vector field along y is a smooth map X : J + R” such that X(t) € T,~)M 
for every tf € I (see Fig. 3.2). The set of smooth vector fields along y is a real vector 
space and will be denoted by 


Vect(y) := {X : I > R"|X is smooth and X(t) € TywM Vt € I}. 


The first derivative X(t) of a vector field along y at ¢ € J will, in general, not 
be tangent to WM. We may decompose it as a sum of a tangent vector and a normal 
vector in the form 


X(t) = My) X@) + (l- Ty@)) XO, 


where JT : M — R”*” is defined by (3.1.2) and (3.1.3). The tangential component 
of this decomposition plays an important geometric role. It is called the covariant 
derivative of X at f. 


Definition 3.2.2 (Covariant derivative) Let J C R be an open interval, let 
y : I — M bea smooth curve, and let X € Vect(y). The covariant derivative of 
X is the vector field VX € Vect(y), defined by 


VX(t) := IT(y(t))X (t) € TyyM (3.2.1) 
fort eT. 


Lemma 3.2.3 (GauB-—Weingarten formula) The derivative of a vector field X 
along a curve y is given by 


X(t) = VX(t) + hy (VO, X(). (3.2.2) 


Here the first summand is tangent to M and the second summand is orthogonal to 
the tangent space of M at y(t). 


Proof This is equation (3.1.6) in the proof of Lemma 3.1.8. Oo 


It follows directly from the definition that the covariant derivative along a 
curve y: 1 > M is a linear operator V : Vect(y) — Vect(y). The following 
lemma summarizes the basic properties of this operator. 


Fig. 3.2 A vector field along 
a curve 
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Lemma 3.2.4 (Covariant derivative) The covariant derivative satisfies the fol- 
lowing axioms for any two open intervals I, J CR. 


(i) Let y:1— M bea smooth curve, let X : I > R be a smooth function, and 
let X € Vect(y). Then 


VAX) =AX +AVX. (3.2.3) 


(ii) Let y: I — M bea smooth curve, let o : J — I be a smooth function and 
let X € Vect(y). Then 


V(X og) =a(VX 90). (3.2.4) 
(ii) Let y: I — M be asmooth curve and let X,Y € Vect(y). Then 


d 


athe) = (VX.Y) + (X, VY). (3.2.5) 


(iv) Let y: I x J ~ M bea smooth map, denote by Vs the covariant derivative 
along the curve s +> y(s,t) (with t fixed), and denote by V, the covariant 
derivative along the curve t +> y(s,t) (with s fixed). Then 


V;01¥ = Vidsy. (3.2.6) 


Proof Part (i) ein from the Leibniz rule 4 (AX) = AX + AX and (ii) follows 
from the chain rule 4 G(X 00) = o(X oo). To prove part (iii), use the orthogonal 
projections /T(y(t)) : R” > T,~)M to obtain 


.Y)+ (X,Y) 
X,I(y)¥) + ((y) X.Y) 
II(y)X,Y) + (X,(y)Y) 
Vir) +i Vy). 


d 
<{X,¥) = 


(X 
= 
=a 
=< 


Part (iv) holds because the second derivatives commute. This completes the proof 
of Lemma 3.2.4. O 


Part (1) in Lemma 3.2.4 asserts that the operator V is what is called a connection, 
part (iii) asserts that it is compatible with the first fundamental form, and part (iv) 
asserts that it is torsion-free. Theorem 3.7.8 below asserts that these conditions (to- 
gether with an extended chain rule) determine the covariant derivative uniquely. 
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3.3 Parallel Transport 


Definition 3.3.1 (Parallel vector field) Let 7 C R be an interval and let y: 1 > M 
be a smooth curve. A vector field X along y is called parallel iff 


VX(t) =0 
for allt € J. 


Example 3.3.2. Assume m =n so that M C R” is an open set. Then a vector field 
along a smooth curve y : J — M isasmooth map X : J > R”. Its covariant deriva- 
tive is equal to the ordinary derivative V X(t) = X(t) and hence X is is parallel if 
and only if it is constant. 


Remark 3.3.3. For every X € Vect(y) and every t € J we have 
VX(t) =0 => X(t) L TyayM. 


In particular, y is a vector field along y and Vy(t) = IT(y(t))y(t). Hence y is a 
parallel vector field along y if and only if y(t) L T)~)M for all t €¢ J. We will 
return to this observation in Chap. 4. 


In general, a vector field X along a smooth curve y : J — M is parallel if and 
only if X(t) is orthogonal to T,;,M for every t and, by the Gau8—Weingarten 
formula (3.2.2), we have 


VX =0 = XS hye) 


The next theorem shows that any given tangent vector vg € T,7,)M extends uniquely 
to a parallel vector field along y. 


Theorem 3.3.4 (Existence and uniqueness) Let J C R be an interval and y : 
I > M bea smooth curve. Let to € I and vo € Tyt))M be given. Then there is a 
unique parallel vector field X € Vect(y) such that X (to) = vo. 


Proof Choose a basis e;,...,@€m of the tangent space 7, (,)M and let 
X1,...,Xm € Vect(y) 
be vector fields along y such that 
Xi (to) = &, a nner 18 


(For example choose X;(t) := IT(y(t))e;.) Then the vectors X;(to) are linearly 
independent. Since linear independence is an open condition there is a constant 
€ > 0 such that the vectors X\(¢),..., Xm(t) € Ty) M are linearly independent for 
every t € Ip := (t) — €,t9 + €) NT. Since T,q)M is an m-dimensional real vector 
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space this implies that the vectors X;(¢) form a basis of T,(1)M for every t € Ip. 
We express the vector VX; (t) € 7)()M in this basis and denote the coefficients by 
a‘ (t) so that 


VXi(t) = > ak @) Xz (0). 


k=1 


The resulting functions ak : Ig — R are smooth. Likewise, if X : J — R” is any 
vector field along y, then there are smooth functions &' : Ij > R such that 


X(t) = b> E(t) X;(t) for all t € Ip. 
i=l 
The derivative of X is given by 


XO) = V(HOXW + OX) 
i=l 
and the covariant derivative by 


m 


VX) = (FOX +8 OVX) 


i=1 


= FOX + EOD aOR) 
i=1 i=l k=1 


= »(F0 +> a OF 0) X,(t) 


k=1 i=1 


fort € Ip). Hence V X(t) = 0 if and only if 


| a) + anit 
E+ AEH =0, A=]: : 
an) + ami) 


Thus we have translated the equation VX = 0 over the interval Jp into a time de- 
pendent linear ordinary differential equation. By a theorem in Analysis II (see [64, 
Lemma 4.4.3]), this equation has a unique solution for any initial condition at 
any point in J. Thus we have proved that every fo € J is contained in an inter- 
val Io C J, open in the relative topology of 7, such that, for every t € Jo and ev- 
ery v; € T,,)M, there exists a unique parallel vector field X : Jp > R” along y|,, 
satisfying X(t,) = v1. We formulate this condition on the interval Jp as a logical 
formula: 


VtEelo Vue Ty4)M AX e Vect(y|1,) 
such that VX = 0 and X(t;) = v,. (3.3.1) 
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If two J-open intervals Jo, J; C J satisfy this condition and have nonempty inter- 
section, then their union Jp U J; also satisfies (3.3.1). (Prove this!) Now define 


J2= | }{7o C I | Jp is an J-open interval, Jp satisfies (3.3.1), tf) € Io}. 


This interval J satisfies (3.3.1). Moreover, it is nonempty and, by definition, it is 
open in the relative topology of J. We prove that it is also closed in the relative 
topology of 7. Thus let (f;);e~ be a sequence in J converging to a point t* € /. 
By what we have proved above, there exists a constant ¢ > 0 such that the in- 
terval [* := (t* —e,t* +) MT satisfies (3.3.1). Since the sequence (f;);c~N con- 
verges to ¢*, there exists ani € N such that 7; € 7*. Since ¢; € J there exists an 
interval Jo C 7, open in the relative topology of 7, that contains fo and f; and 
satisfies (3.3.1). Hence the interval Jy) U J* is open in the relative topology of J, 
contains fy and f*, and satisfies (3.3.1). This shows that t* € J. Thus we have proved 
that the interval J is nonempty, and open and closed in the relative topology of J. 
Hence J = J and this proves Theorem 3.3.4. 


Definition 3.3.5 (Parallel transport) Let 7 C R bean interval and let y : ] — M 
be a smooth curve. For fo,¢ € J we define the map 


P,(t,t0)  Ty()M > TyyM 


by ®,(t,to)vo := X(t) where X € Vect(y) is the unique parallel vector field 
along y satisfying X(to) = vo. The collection of maps ®,(¢, to) for t,t € J is 
called parallel transport along y. 


Recall the notation 
y*TM = {(s,v)|s €1, v € Ty) M} 


for the pullback tangent bundle. This set is a smooth submanifold of J x R”. (See 
Theorem 2.6.10 and Corollary 2.6.13.) The next theorem summarizes the proper- 
ties of parallel transport. In particular, the last assertion shows that the covariant 
derivative can be recovered from the parallel transport maps. 


Theorem 3.3.6 (Parallel transport) Let y : 1 — M be a smooth curve on an 
interval I C R. 


(i) The map ®,(t,s) + T,s)M — Tyqy)M is linear for all s,t € I. 
(ii) Forallr,s,t € I we have 


®,(t,5)° Py (s,r) = Py (t,r), ®, (t,t) = id. 
(iii) For alls,t € I andallv,w € T,s)M we have 
(®,(t,8)v, ®, (t,s)w) = (v, w). 


Thus ®,(t,8) : Ty(s)M — T,q)M is an orthogonal transformation. 
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(iv) If J C Ris an interval ando : J — I is a smooth map, then 
Py oo (t, 8) = By (a(t), a(s)). 


foralls,té J. 
(v) The map 


Ixy*TM > y*TM : (t,(s,v)) & (t, ® (t, 5)v) 


is smooth. 
(vi) For all X € Vect(y) and t,t) € I we have 


© by t)X(t) = By (to, 1)V X(t). 


Proof Assertion (i) holds because the sum of two parallel vector fields along y is 
again parallel and the product of a parallel vector field with a constant real number 
is again parallel. Assertion (11) follows directly from the uniqueness statement in 
Theorem 3.3.4. 

We prove (iii). Fix a number s € J and two tangent vectors 


v,w € Tys)M. 
Define the vector fields X, Y € Vect(y) along y by 
X(t) := ®,(t, s)v, Y(t) := Oy (t,s)w. 
These vector fields are parallel. Thus, by equation (3.2.5) in Lemma 3.2.4, we have 


f(x.) = (VX,Y) + (X, VY) =0. 


Hence the function J +> R:t +> (X(t), Y(t)) is constant and this proves (iii). 
We prove (iv). Fix an element s € J and a tangent vector v € Ty(o(s))M. Define 
the vector field X along y by 


X(t) := @,(t,0(s))v 


fort € J. Thus X is the unique parallel vector field along y that satisfies 


X(o(s)) = v. 
Denote 


V:=yoo:J—>M, X¥ := Xoo:1 >R" 
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Then X is a vector field along ¥ and, by the chain rule, we have 


d— d . : 
ZX) = EXO) = 6OX(o(t)). 


Projecting orthogonally onto the tangent space T)(.(;))M we obtain 
VX (t) = a(t)VX(a(t)) = 0 


for every t € J. Hence X is the unique parallel vector field along 7 that satisfies 
X(s) = v. Thus 


P(t, 8)v = X(t) = X(o(t)) = (a(t), 0(s))v. 


This proves (iv). 

We prove (v). Fix a point fo € 7, choose an orthonormal basis e1,...,e@, of 
Ty()M, and define X;(t) := ®,(¢,to)e; fort ¢ J andi = 1,...,m. Thus X; € 
Vect(y) is the unique parallel vector field along y such that X; (f)) = e;. Then by (iii) 
we have 


(X;(t), X;(¢)) = 6 


for alli, 7 € {1,...,m} and allt € J. Hence the vectors X,(t),..., X(t) form an 
orthonormal basis of T,;)M for every t ¢ J. This implies that, for each s € J and 
each tangent vector v € Ts) M, we have 


m 


v= Y(Xi(s), v) X;(s). 


i=l 
Since each vector field X; is parallel it satisfies X;(¢) = ®,(t, s)X;(s). Hence 


m 


®,(t,s)v = )-(Xi(s), v) Xi (0) (3.3.2) 


i=] 


for all s,¢ € J and v € T,,s)M. This proves (v). 
We prove (vi). Let X,..., Xm € Vect(y) be as in the proof of (v). Thus every 
vector field X along y can be written in the form 


XO=VEOKO, FO := (Xi, X(0). 


i=l 
Since the vector fields X; are parallel we have 


m 


VX(t) = DoE (OX 


i=1 
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for all t € J. Hence 


m 


Py (t,t) X(t) = OE (OH)XiGo), — Py, NV X() = DE (Xi). 


t=] i=l 


Evidently, the derivative of the first sum with respect to ¢ is equal to the second sum. 
This proves (vi) and Theorem 3.3.6. Oo 


Remark 3.3.7 For s,¢ € J we can think of the linear map 
&,(t,s)IT(y(s)) : R" > TyyM CR" 


as areal n xn matrix. The formula (3.3.2) in the proof of (v) shows that this matrix 
can be expressed in the form 


P(t, s)IT(y(s)) = Y> Xi (1) Xi(s)" eR", 


i=l 


The right hand side defines a smooth matrix valued function on J x J and this is 
equivalent to the assertion in (Vv). 


Remark 3.3.8 It follows from assertions (ii) and (iii) in Theorem 3.3.6 that 
®,(t,s) | = ,(s,t) = ,(t,5)* 


for all s,t ¢ J. Here the linear map ®, (t, s)* : Tq) M — T,,s)M is understood as the 
adjoint operator of ®,,(t, 8) : Ty,s)M — T,~)M with respect to the inner products on 
the two subspaces of IR” inherited from the Euclidean inner product on the ambient 
space. 


The two theorems in this section carry over verbatim to any smooth vector bundle 
E Cc M x R" over a manifold. As in the case of the tangent bundle one can define 
the covariant derivative of a section of FE along y as the orthogonal projection of 
the ordinary derivative in the ambient space IR” onto the fiber E,). Instead of 
parallel vector fields one then speaks about horizontal sections and one proves as 
in Theorem 3.3.4 that there is a unique horizontal section along y through any point 
in any of the fibers F,,(,,.). This gives parallel transport maps from E,,s) to Ey,r) 
for any pair s,t € J and Theorem 3.3.6 carries over verbatim to all vector bundles 
E CM xR". We spell this out in more detail in the case where E= TM+ C M xR" 
is the normal bundle of M. 

Let y : 1 — M beasmooth curve. A normal vector field along y is a smooth 
map Y : J > R" such that Y(t) L T,()M for every t ¢ J. The set of normal vector 
fields along y will be denoted by 


Vect*(y) := {Y : I > R"|Y is smooth and Y(t) L T,q)M for allt € I}. 
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This is again a real vector space. The covariant derivative of a normal vector field 
Y € Vect!(y) at t € J is defined as the orthogonal projection of the ordinary deriva- 
tive onto the orthogonal complement of 7,()M and will be denoted by 


V+Y() := (l—M(y@))¥(@. (3.3.3) 
Thus the covariant derivative defines a linear operator 
V+: Vectt(y) > Vectt(y). 


There is a version of the Gau8—Weingarten formula for the covariant derivative of a 
normal vector field. This is the content of the next lemma. 


Lemma 3.3.9 Let M C R" be a smooth m-manifold. For p € M andu € T,M 
define the linear map hp(u) : T,>M > T,M+ by 


h,(u)v := h, (u,v) = (dIT(p)u)v (3.3.4) 
for v € T,M. Then the following holds. 
(i) The adjoint operator hp(u)* : T,M+ — T,M is given by 
h,(u)*w = (dIT(p)u)w, we T,M-. (3.3.5) 


(i) JfI C Ris an interval, y: I > M isasmooth curve, and Y € Vect"(y), then 
the derivative of Y satisfies the GauS—Weingarten formula 


Yt) = VY) — hyn Gy" YO). (3.3.6) 


Proof Since IT(p) € R”*" is a symmetric matrix for every p € M so is the matrix 
dII(p)u for every p € M and every u € T,M. Hence 


(v, hp (u)*w) = (hp(u)v, w) 
= ((dIT(p)u)v, w) 
= (v, (d/T(p)u)w) 


for every v € T,M and every w € T,M +. This proves (i). 
To prove (ii) we observe that, for Y € Vect+(y) andt € I, we have 


IT(y(t))¥(t) = 0. 


Differentiating this identity we obtain 


My) ¥ (t) + (dy@)yO)¥) = 0 
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and hence 


YO =Y¥O)-AyO)YO- (€1yM)yO)YO 
= V+Y(t) —hyo (yt) Y@) 


for t € I. Here the last equation follows from (i) and the definition of V+. This 
proves Lemma 3.3.9. O 


Theorem 3.3.4 and its proof carry over to the normal bundle TM+. Thus, if 
y : I + M isasmooth curve, then for all s €¢ J and w € T,(s)M + there is a unique 
normal vector field Y € Vect(y) such that 


VY=0, Y@)=w. 
This gives rise to parallel transport maps 
By (t,8) : Tyg) M~ > TyyM~ 
defined by 
®;(t,s)w := Y(t) 
for s,t € J andw € T,y)M +, where Y is the unique normal vector field along y 
satisfying V+ Y =0 and Y(s) = w. These parallel transport maps satisfy exactly the 


same conditions that have been spelled out in Theorem 3.3.6 for the tangent bundle 
and the proof carries over verbatim to the present setting. 


3.4 The Frame Bundle 


Each tangent space of an m-manifold M is isomorphic to the Euclidean space R’”, 
however, in general there is no canonical isomorphism. The space of all pairs con- 
sisting of a point p in the manifold M and an isomorphism from R” to the tangent 
space of M at p is itself a smooth manifold, called the frame bundle of M. 


3.4.1 Frames of a Vector Space 


Let V be an m-dimensional real vector space. A frame of V is a basis €1,..., @ of 
V. It determines a vector space isomorphism e : R™ — V via 


m 


e€:= ) fe, E=(€',...,e") ER”. 
Z=1 
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Conversely, each isomorphism e : R” — V determines a basis €),...,@ of V via 
e; =e(0,...,0,1,0...,0), where the coordinate 1 appears in the ith place. The set 
of vector space isomorphisms from R” to V will be denoted by 


Liso(R™, V) := {e : R” > V |e isa vector space isomorphism}. 


The general linear group GL(m) = GL(m, R) (of nonsingular real m x m-matrices) 
acts on this space by composition on the right via 


GL(m) X Liso(R”, V) > Liso IR”, VV): (a,e) B a*e =e 0a. 
This is a contravariant group action in that 
a*b*e = (ba)*e, l*e=e 


for a,b € GL(m) and e € £;,,.(R’, V). Moreover, the action is free, i.e. for all 
a € GL(m) ande € £;,,(R’”, V), we have 


a*e=e —> a=l1. 
It is transitive in that for all e, e’ € Lig, IR”, V) there is a group element a € GL(m) 
such that e’ = a*e. Thus we can identify the space Li,.(R”, V) with the group 
GL(m) via the bijection 


GL(m) = Liso(R”, V):arh a*eo 


induced by a fixed element eg € £;,.(R”, V). This identification is not canonical; 
it depends on the choice of é9. The space £;,,(R”, V) admits a bijection to a group 
but is not itself a group. 


3.4.2 The Frame Bundle 


Definition 3.4.1 (Frame bundle) Let M Cc R” be a smooth m-manifold. The 
frame bundle of / is the set 


F(M) = {(p.e)|p € M,e © F(M)p}, (3.4.1) 
where F (M), is the space of frames of the tangent space at p, i.e. 
F(M), := Ligo(R”". TM). 
Define a right action of GL(m) on ¥ (M) by 
a*(p,e) := (p,a*e) = (p,eoa) (3.4.2) 


for a € GL(m) and (p,e) € F(M). 
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One can think of a frame e € Lis.(R”, T,M) as a linear map from R’” to R” 
whose image is 7, M and hence as an n x m-matrix of rank m. The basis of T, M 
associated to this frame is given by the columns of the matrix e € R”*’”". Thus the 
frame bundle ¥ (M) of an embedded manifold M C R” is a subset of the Euclidean 
space R” x RX" 


Lemma 3.4.2 The frame bundle 
F(M) Cc R" x R™ 
is a smooth manifold of dimension m + m?, the group action 
GL(m) x F(M) > F(M) : (a, (p.e)) > a*(p,e) 
is smooth, and the projection 
a: F(M)>M 


defined by n(p,e) := p for (p,e) € F(M) is a surjective submersion. The orbits 
of the GL(m)-action on F(M) are the fibers of this projection, i.e. 


GL(m)*(p,e) = x" '(p) = F(M), 


for (p,e) € F(M), and the group GL(m) acts freely and transitively on each of 
these fibers. 


Proof Let U C M be an M-open set. A moving frame over U is a sequence 
of m smooth vector fields F,,..., Ey, € Vect(U) on U such that the vectors 
E\(p),..-; E,,(p) form a basis of TM for each p € U. Any such moving frame 
gives a bijection 


U x GL(m) > FU) : (p,a) > a*(p, E(p)) = (p, E(p) 04), 


where 


E(p) := (E\(p),-.-, Em(p)) € F(M)p 


for p € U. This bijection (when composed with a parametrization of U) gives a 
parametrization of the open set F(U) in F(M). The assertions of the lemma then 
follow from the fact that the diagram 


U x GL(m) FU) 
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commutes. More precisely, suppose that there exists a coordinate chart 
g¢:U>2 
with values in an open set S2 C R”, and denote its inverse by 
wi= ol: 20. 
Then the open set 
F(U) = (VU) = {(p,e) € F(M) | p € U} = (U x R™”) N F(M) 

is parametrized by the map 

Q x GL(m) > FU) : (x, a) + (W(x), d(x) oa). 
This map is amooth and so is its inverse 

F(U) > @ x GL(m) : (p,e) > (¢(p).d4(p) 08). 


These are the desired coordinate chart on F(M). Thus ¥ (M) is a smooth manifold 
of dimension m + m*. Moreover, in these coordinates the projection x : F(U) > U 
is the map 2 x GL(m) > 22: (x, a) # x and so z is a submersion. The remaining 
assertions follow directly from the definitions and this proves Lemma 3.4.2. O 


The frame bundle F(M) is a principal bundle over M with structure group 
GL(m). More generally, a principal bundle over a manifold B with structure group 
G is a smooth manifold P equipped with a surjective submersion z : P — Banda 
smooth contravariant action 


Gx P—>P:(g,p) pg 


by a Lie group G such that z(pg) = z(p) for all p € P and g € Gand such that 
the group G acts freely and transitively on the fiber P, := 2~!(b) for each b € B. 
In this book we shall mostly be concerned with the frame bundle of a manifold M 
and the orthonormal frame bundle. 


Definition 3.4.3 The orthonormal frame bundle of / is the set 
O(M) := {(p,e) € R" x R"*” | p € M, ime = T,M, e'e = 1,3. 


If we denote by e; := e(0,...,0,1,0,...,0) (with 1 as the ith argument) the basis 
of T,M induced by the isomorphism e : R” — T,M, then we have 


€1,...,@m 18 an 


Acai 24 €3,€;) = bi i 
(e;,e;) Wy orthonormal basis. 


Thus O(M) is the bundle of orthonormal frames of the tangent spaces 7, M or the 
bundle of orthogonal isomorphisms e : IR” — 7, M. It is a principal bundle over M 
with structure group O(m). 
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Exercise 3.4.4 Prove that O(M) is a submanifold of F(M) and that the obvious 
projection z : O(M) — M is a submersion. Prove that the action of GL(m) on 
F(M) restricts to an action of the orthogonal group O(m) on O(M) whose orbits 
are the fibers 


O(M), := {e € R”™” | (p,e) € O(M)} 
= {e € £L,.(R”,T,M) |e'e = 1}. 
Hint: If ¢ : U > 22 is acoordinate chart on M with inverse y : 2 — U, then 
ex = d(x) (dw (xdwQy? : R" > TyoyM 


is an orthonormal frame of the tangent space Ty(.)M for every x € Q. 


3.4.3 Horizontal Lifts 


We have seen in Lemma 3.4.2 that the frame bundle # (M) is a smooth submanifold 
of R” x R”*’". Next we examine the tangent space of #(M) at a point (p,e) € 
F(M). By Definition 2.2.1, this tangent space is given by 


R—> F(M):t (v0), e@)) 
is a smooth curve satisfying 
y(0) = p and e(0) =e 


T(p.e)F (M) = 4 (7(0), €(0)) 


The next lemma gives an explicit formula for this tangent space in terms of the sec- 
ond fundamental form h, : T,M x T,M — T,M + in Definition 3.1.9. Compare 
this formula with Lemma 4.3.1 in the next chapter. 


Lemma 3.4.5 Let M C R"” be a smooth m-dimensional submanifold. Then the 
tangent space of # (M) at (p,e) is given by 


Pp €T,M,@ € R”™”", and 


(1 — 11(p))@ = hp Be ee, 


Tp.) F(M) = Jag 


Proof We prove the inclusion “C” in (3.4.3). Let (D,@) € Tipe) F (M) and choose 
a smooth curve R > #(M): tb (y(t), e(f)) such that 
y(0) = p, e(0) =e, y(0) = 3, é(0) =e. 
Fix a vector & € R” and define the vector field X € Vect(y) by X(t) := e(t)é 
for t € R. Then the Gau8—Weingarten formula (3.2.2) asserts that 
é(té = X(t) 
= VX) + hyo. XO) 
= My )eME + hyn VO), eE) 


3.4 The Frame Bundle 137 
for allt € R. Take t = 0 to obtain 


(1 — IT (p)je& = hy (DP. e&) = hp (peg 


for all € € R”. This proves the inclusion “C” in (3.4.3). Equality holds because both 
sides of the equation are (m + m7)-dimensional linear subspaces of R” x R"*”". 
This proves Lemma 3.4.5. Oo 


It is convenient to consider two kinds of curves in F(M), namely vertical curves 
with constant projections to M and horizontal lifts of curves in M. We denote by 
L£(R”, T,M) the space of linear maps from R” to T,M. 


Definition 3.4.6 (Horizontal lift) Let y : R — M bea smooth curve. A smooth 
curve 8 : R > F(M) is called a lift of y iff 


wop=y. 
Any such lift has the form 6(t) = (y(t), e(¢)) with e(¢) € Liso(R”, T,)M). The 
associated curve of frames e(t) of the tangent spaces T,;)M is called a moving 
frame along y. A curve 
B(t) = (v(@),e(@)) € F(M) 
is called horizontal or a horizontal lift of y iff the vector field 
X(t) = e(té 
along y is parallel for every € € R”. Thus a horizontal lift of y has the form 
B(t) = (V(t), ®, @, O)e) (3.4.4) 
for some e € Lig.(R”, To) M). 
Lemma 3.4.7 
(i) The tangent space of F (M) at (p,e) € F(M) is the direct sum 
T(p.e)F (M) = Ape) ® Vine) 
of the horizontal space 
Hope) = {(v, hp(v)e) |v € T,M} (3.4.5) 
and the vertical space 


Vip) = {0} x £(R”, TM). (3.4.6) 
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(ii) The vertical space Vip.) at (p,e) € F(M) is the kernel of the linear map 
dm(p,e): Tipe) F(M) > T,M. 

(iii) A curve B: IR > F(M) is horizontal if and only if it is tangent to the horizontal 
spaces, i.e. B(t) € Hpq for every t € R. 

(iv) IfB : R > F(M) is a horizontal curve, so is a* B for every a € GL(m). 

Proof The proof has four steps. 

Step 1 Let (p,e) € F(M). Then Vipe) = ker da(p,e) C Tipe) F (M). 

Since z is a submersion, the fiber 2~!(p) is a submanifold of ~(M) by Theo- 


rem 2.2.19 and T,p,)1~'(p) = ker dx(p, e). Now let (p,@) € ker dx(p,e). Then 
there exists a vertical curve 8 : R > ¥(M) with z o B = p such that 


B(0) =(p.e),  B(0) = (2). 


Any such curve has the form A(t) := (p,e(t)) where e(t) € Li.(R”,7,M). 
Hence p = 0 and @ = é(0) € £(R”, T, M). This shows that 


ker d(p,e) C Viney. (3.4.7) 
Conversely, for every @ € £(R”, 7, M), the curve 
R — £(R",T,M):tre(t):=e+fte 


takes values in the open set Lis.(R”,7,M) for ¢ sufficiently small and hence 
B(t) := (p, e(t)) is a vertical curve with B(0) = (0,@). Thus 


Vip,e) C ker dr(p,e) C Tipe) F(M). (3.4.8) 
Combining (3.4.7) and (3.4.8) we obtain Step | and part (11). 


Step 2 Let (p,e) € F(M). Then Hip.) C Tipe) F (M ). Moreover, every horizontal 
curve B : RR > F(M) satisfies B(t) € Hg for allt € R. 


Fix a tangent vector v € 7,M, let y : R > M be a smooth curve satisfy- 
ing y(0) = p and (0) = v, and let B : R > F(M) be the horizontal lift of y 
with 6(0) = (p,e). Then 

BO) =(V).e@), —e(t) = G (t, Oe. 


Fix a vector € € R” and consider the vector field 


X(t) := e(t)§ = %,(t, O)eE 
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along y. This vector field is parallel and hence, by the Gau8—Weingarten formula, 
it satisfies 


é(O)E = X(0) = hyo) (0), X(0)) = hp(v)eé. 
Here we have used (3.3.4). Thus 
(v, hp(v)e) = (70), €(0)) = BO) € Tg) F(M) = Tipe) F(M) 


and so Hip.) C Tp.e) F (M). Moreover, B(0) = (v,h,(v)e) € Hipe) = Hg) and 
this proves Step 2. 


Step 3 We prove part (i). 


We have Vip.2) C Tp.) F (M) by Step 1 and Hip.) C Tip.) F (M) by Step 2. More- 
over Hip ey A Vine) = {0} and so Tipe) F(M) = Hype) ® Vip) for dimensional 
reasons. This proves Step 3. 


Step 4 We prove parts (iii) and (iv). 


By Step 2 every horizontal curve B : R > F(M) satisfies B(t) € Agi). Con- 
versely, let R—> F(M):t+> B(t) = (yd), e(t)) be a smooth curve satisfy- 
ing B(t) € Hgq) for all ¢. Then é(t) = h,w(y(t))e(t) for all ¢. By the Gaub- 
Weingarten formula (3.2.2) this implies that the vector field X(t) = e(t)& along y 
is parallel for every — € R”, so B is horizontal. This proves part (iii). Part (iv) 
follows from (iii) and the fact that the horizontal tangent bundle H Cc T¥(M) is 
invariant under the induced action of the group GL(m) on T ¥(M). This proves 
Lemma 3.4.7. O 


The reason for the terminology introduced in Definition 3.4.6 is that one draws 
the extremely crude picture of the frame bundle displayed in Fig. 3.3. One thinks of 
F(M) as “lying over” M. One would then represent the equation y = 0 B by the 
following commutative diagram: 


F(M) ; 
wa 
R a M 


hence the word “lift”. The vertical space is tangent to the vertical line in Fig. 3.3 
while the horizontal space is transverse to the vertical space. This crude imagery 
can be extremely helpful. 


Exercise 3.4.8 The group GL(m) acts on F(M) by diffeomorphisms. Thus for 
each a € GL(m) the map 


F(M) > F(M) : (p,e) > a*(p,e) = (p,e 0a) 
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Fig. 3.3 The frame bundle F(M)p= x (p) 


| 


FM) 


(p.e) 


is a diffeomorphism of F(M). The derivative of this diffeomorphism is a diffeo- 
morphism of the tangent bundle T #(M) and this is called the induced action of 
GL(m) on TF (M). Prove that the horizontal and vertical subbundles are invariant 
under the induced action of GL(m) on TF(M). 


Exercise 3.4.9 Prove that Hp.) C T(p.)O(M) and that 


Tp.) O(M) = Hipe) @ Viney» — Vine) = Viney O Mp.) O(M) 


/ . 
pe) * 
for every (p,e) € O(M). 

The following definition introduces an important class of vector fields on the 


frame bundle that will play a central role in Sect. 3.5. They will be used to prove 
the Development Theorem 3.5.21 in Sect. 3.5.4 below. 


Definition 3.4.10 (Basic vector field) Every vector § € R” determines a vector 
field Bg € Vect(F (M)) defined by 


Be(p,e) := (e&, hp (e&)e) (3.4.9) 


for (p,e) € F(M). This vector field is horizontal, 1.e. 


Bz (p,e) € Ap); 


and projects to e€, i.e. 


dm(p,e)Be(p,e) = e& 


for all (p,e) € F(M). These two conditions determine the vector field Bg uniquely. 
It is called the basic vector field corresponding to &. 


Exercise 3.4.11 

(i) Prove that every basic vector field Bg € Vect(F (M)) is tangent to the orthonor- 
mal frame bundle O(M). 

(ii) Let RR F(M):t + (y(t), e()) be an integral curve of the vector field Bg 
and a € GL(m). Prove that R > F(M) : th a* B(t) = (y(t), a*e(t)) is an 
integral curve of B,-1¢. 
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(iii) Prove that the vector field Bg € Vect(F(M)) is complete for all € € R” if and 
only if the restricted vector field Bg|ocm) € Vect(O(M )) on the orthonormal 
frame bundle is complete for all € € R’”. 


Definition 3.4.12 (Complete manifold) A smoth m-manifold M Cc R” is 
called complete iff, for every smooth curve € : R — R” and every element 
(po,e0) € F(M), there exists a smooth curve B : R — F(M) such that 


B(0) = (po. eo) and B(t) = Bec(B(t)) for all t € R. 


3.5 Motions and Developments 


Our aim in this sections is to define motion without sliding, twisting, or wobbling. 
This is the motion that results when a heavy object is rolled, with a minimum of 
friction, along the floor. It is also the motion of the large snowball a child creates as 
it rolls it into the bottom part of a snowman. 

We shall eventually justify mathematically the physical intuition that either of 
the curves of contact in such ideal rolling may be specified arbitrarily; the other is 
then determined uniquely. Thus for example the heavy object may be rolled along 
an arbitrary curve on the floor; if that curve is marked in wet ink, another curve will 
be traced in the object. Conversely, if a curve is marked in wet ink on the object, the 
object may be rolled so as to trace a curve on the floor. However, if both curves are 
prescribed, it will be necessary to slide the object as it is being rolled, if one wants 
to keep the curves in contact. 

We assume throughout this section that M and M’ are two m-dimensional sub- 
manifolds of R”. Objects on M’ will be denoted by the same letter as the cor- 
responding objects on M with primes affixed. Thus for example /7’(p’) € R”’*”" 
denotes the orthogonal projection of IR” onto the tangent space T,,M’, V’ denotes 
the covariant derivative of a vector field along a curve in M’, and ae denotes par- 
allel transport along a curve in M’. 


3.5.1 Motion 


Definition 3.5.1 A motion of M along M’ (on an interval / C R) is a triple 
(YW, y, y’) of smooth maps 


W:I > O(n), y:Il>M, y':I > M' 
such that 
W(t)Tya)M = TyyM' Vitel. 
Note that a motion also matches normal vectors, i.e. 


W(t)TygyM+t =TywM'~ Vel. 
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Remark 3.5.2. Associated to a motion (Y% y, y’) of M along M’ is a family of 
(affine) isometries y; : IR” — R” defined by 


Wilp) = y(t) + ¥(O)(p — vy) (3.5.1) 
fort € J and p € R”. These isometries satisfy 


Wily) = y'(0), dWi(y())Ty)M = Ty M’ Vired, 


Remark 3.5.3 There are three operations on motions. 


Reparametrization If (WY, y, y’) is a motion of M along M’ on an interval J C R 
and o : J — I is a smooth map between intervals, then the triple 


(Woo,yoo,y' oa) 
is a motion of M along M’ on the interval J. 
Inversion If (Y, y, y’) is a motion of M along M’, then 
wy”) 
is a motion of M’ along M. 


Composition If (YW y,y') is a motion of M along M’ on an interval J and 
(W’, y’, y”) is a motion of M’ along M” on the same interval, then 


ww yy") 
is a motion of M along M”. 

We now give the three simplest examples of “bad” motions; i.e. motions which 
do not satisfy the concepts we are about to define. In all three of these examples, p 
is a point of M and M’ is the affine tangent space to M at p: 

M' := p+T,M = {p+v|veET,M}. 
Example 3.5.4 (Pure sliding) Take a nonzero tangent vector v € T,M and let 
y(t) := p, y'(t)=pt+tv, W(t) :=1. 
Then y(t) = 0, y(t) = v 4 0, and so 
Wty) # (0). 
(See Fig. 3.4.) 
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Fig. 3.4 Pure sliding M 


Fig.3.5 Pure twisting fa 


Fig. 3.6 Pure wobbling eae 


Example 3.5.5 (Pure twisting) Let y and y’ be the constant curves 
y@)=y'(t) =p 


and take W(f) to be the identity on 7,M + and any curve of rotations on the tangent 
space T,,M. As a concrete example with m = 2 and n = 3 one can take M to be 
the sphere of radius one centered at the point (0, 1,0) and p to be the origin: 


M :={(x,y, eR |x? +(y-1?+2=1),  p:=(0,0,0). 


Then M’ is the (x,z)-plane and A(t) is any curve of rotations in the (x, z)-plane, 
i.e. about the y-axis T,M+. (See Fig. 3.5.) 


Example 3.5.6 (Pure wobbling) This is the same as pure twisting except that 
W(t) is the identity on 7,M and any curve of rotations on 7, M +. As a concrete 
example with m = | andn = 3 one can take M to be the circle of radius one in the 
(x, y)-plane centered at the point (0, 1,0) and p to be the origin: 


M := {(x,y,0)€R3[x°+(y-1P =], p= (0,0, 0). 


Then M' is the x-axis and W(t) is any curve of rotations in the (y, z)-plane, i.e. 
about the axis M’. (See Fig. 3.6.) 


3.5.2 Sliding 


When a train slides on the track (e.g. in the process of stopping suddenly), there is 
a terrific screech. Since we usually do not hear a screech, this means that the wheel 
moves along without sliding. In other words the velocity of the point of contact in 
the train wheel M equals the velocity of the point of contact in the track M’. But 
the track is not moving; hence the point of contact in the wheel is not moving. One 
may explain the paradox this way: the train is moving forward and the wheel is 
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rotating around the axle. The velocity of a point on the wheel is the sum of these 
two velocities. When the point is on the bottom of the wheel, the two velocities 
cancel. 


Definition 3.5.7. A motion (Wy, y’) is said to be without sliding iff it satis- 
fies W(t)y(t) = y(t) for every ft. 


Here is the geometric picture of the no sliding condition. As explained in Re- 
mark 3.5.2 we can view a motion as a smooth family of isometries 


Vi(p) = y(t) + ¥O)(p — v@)) 


acting on the manifold M with y(t) € M being the point of contact with M’. Dif- 
ferentiating the curve tf +> w,(p) which describes the motion of the point p € M in 
the space R” we obtain 


d ‘ 
avi = y(t) —YOVO + YO(p-y@). 


Taking p = y(to) we find 


7 Velr(o)) = Y' (to) — Y (to) (to). 


t=lo 


This expression vanishes under the no sliding condition. In general the curve t b> 
W,(y(to)) will be non-constant, but (when the motion is without sliding) its velocity 
will vanish at the instant ¢ = fo; i.e. at the instant when it becomes the point of 
contact. In other words the motion is without sliding if and only if the point of 
contact is motionless. 

We remark that, if the motion is without sliding, we have: 


IYO! = |YOrO! = lrOl 


so that the curves y and y’ have the same arclength: 


fioid = [vora 


on any interval [f, t;] C 7. Hence any motion with y = 0 and yp’ ¥ 0 is not without 
sliding (such as the example of pure sliding above). 


Exercise 3.5.8 Give an example of a motion where |y(t)| = |y’(t)| for every ft but 
which is not without sliding. 
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Example 3.5.9 We describe mathematically the motion of the train wheel. Let the 
center of the wheel move right parallel to the x-axis at height one and the wheel 
have radius one and make one revolution in 27 units of time. Then the track M’ is 
the x-axis and we take 


M := {(x,y) € R?|x*°+(y- 1) = Ih. 
Choose 
y(t) := (cos(t — 1/2), 1 + sin(t — 2/2)) 
= (sin(t), | — cos(t)), 
y(t) := (0), 
and define W(t) € GL(2) by 
W(t) = cos(t) sin(t) 
“ \—sin(t) — cos(t))’ 


The reader can easily verify that this is a motion without sliding. A fixed point po 
on M, say po = (0,0), sweeps out a cycloid with parametric equations 


x =t—sin(f), y = 1-cos(f). 
(Check that (x, y) = (0,0) when y = 0;1.e. fort = 2nz.) 
Remark 3.5.10 These same formulas give a motion of a sphere M rolling without 


sliding along a straight line in a plane M’. Namely in coordinates (x, y,z) the 
sphere is given by the equation 


x? +(y—1)? +27 =1, 


the plane is y = 0 and the line is the x-axis. The z-coordinate of a point is unaffected 
by the motion. Note that the curve y’ traces out a straight line in the plane M’ and 
the curve y traces out a great circle on the sphere M. 


Exercise 3.5.11 The operations of reparametrization, inversion, and composition 
respect motion without sliding; i.e. if (WY y, y’) and (W’, y’, y”) are motions without 
sliding on an interval J ando : J — I is a smooth map between intervals, then the 
motions (Woo, yoo, y’oa), (W!, y’,y), and (WY, y, y”) are also without sliding. 


3.5.3 Twisting and Wobbling 


A motion (Y y, y’) on an intervall 7 C R transforms vector fields along y into 
vector fields along y’ by the formula 


X(t) = (WX) (t) = W)X(t) € Ty) M’ 
fort € J and X € Vect(y); so X’ € Vect(y’). 
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Lemma 3.5.12 Let (YW, y, y’) be a motion of M along M' on an interval I C R. 
Then the following are equivalent. 


(i) The instantaneous velocity of each tangent vector is normal, i.e. fort € I 
W(t)T MC TM”. 
(ii) W intertwines covariant differentiation, i.e. for X € Vect(y) 
V' (WX) = WVX., 


(iii) W transforms parallel vector fields along y into parallel vector fields along y', 
ie. for X © Vect(y) 


VX =0 = V'(WX) = 0. 
(iv) W intertwines parallel transport, i.e. for s,t € I andv € T,(s)M 
W(t)D,(t,s)v = B,(t, s)W(s)v. 


A motion that satisfies these conditions is called without twisting. 


Proof We prove that (i) is equivalent to (11). A motion satisfies the equation 
YOM yO) = TY OVO 


for every t € J. This restates the condition that W(t) maps tangent vectors of M to 
tangent vectors of M’ and normal vectors of M to normal vectors of M’. Differen- 
tiating the equation X'(t) = W(t) X(t) we obtain 


X'(t) = W(t)X (t) + W(t) X(0). 
Applying IT7’(y'(t)) this gives 
VX! = WVX $+ T'(y' WX. 


Hence (ii) holds if and only if I7’(y’(t))W(t) = 0 for every t € J. Thus we have 
proved that (i) is equivalent to (ii). That (ii) implies (iii) is obvious. 

We prove that (iii) implies (iv). Let fo €¢ J and vp € Ty )M. Define the vec- 
tor field X¥ € Vect(y) by X(t) := ,(¢, f)vo for t € J and let X’ := WX. Then 
VX = 0, hence V’X’ = 0 by (iii), and hence 


X"(t) = (6, 0) X (to) = By, (¢, to) W (to) v0 


for allt € I. Since X'(t) = W(t) X(t) = W(t) ®,(t, to) vo, this implies (iv). 
We prove that (iv) implies (ii). Let X¥ € Vect(y) and X’ := WX. By (iv) we have 


PD), (to, t)X'(t) = W(to)®, (to, t) X(t). 


Differentiating this equation with respect to ¢ at f = f and using Theorem 3.3.6, we 
obtain V’X' (to) = W(to)V X (to). This proves the lemma. Oo 
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Lemma 3.5.13 Let (W, y, y’) be a motion of M along M' on an interval I C R. 
Then the following are equivalent. 


(i) The instantaneous velocity of each normal vector is tangent, i.e. fort € I 
W)TyayMt C TyqM'. 
(ii) W intertwines normal covariant differentiation, i.e. for Y € Vect*(y) 
V-(WwY) =wv-y. 


(iii) W transforms parallel normal vector fields along y into parallel normal vector 
fields along y', i.e. for Y € Vect+(y) 


vty =0 = Vi (WY) =0. 


(iv) W intertwines parallel transport of normal vector fields, i.e. for s,t € I and 
We Ty(s)M+ 


W(t) DS (t,s)w = B(t, s)W(s)w. 
A motion that satisfies these conditions is called without wobbling. 


The proof that the four conditions in Lemma 3.5.13 are equivalent is word for 
word analogous to the proof of Lemma 3.5.12 and will be omitted. 

In summary a motion is without twisting iff tangent vectors at the point of contact 
are rotating towards the normal space and it is without wobbling iff normal vectors 
at the point of contact are rotating towards the tangent space. Incase m = 2 andn = 
3 motion without twisting means that the instantaneous axis of rotation is parallel 
to the tangent plane. 


Remark 3.5.14 The operations of reparametrization, inversion, and composition 
respect motion without twisting, respectively without wobbling; i.e. if (Wy, y’) 
and (W’, y’, y”) are motions without twisting, respectively without wobbling, on 
an interval J ando : J — J is a smooth map between intervals, then the motions 
(Woo,yoo,y'oa), (W!,y’, y), and (WW, y, y”) are also without twisting, re- 
spectively without wobbling. 


Remark 3.5.15 Let J C R be an interval and f9 € J. Given curves y : J > M and 
y’ : I = M' and an orthogonal matrix W € O(7) such that 


WoT y(9)M = Ty(9)M' 


there is a unique motion (Y, y, y’) of M along M’ (with the given y and y’) without 
twisting or wobbling satisfying the initial condition: 


W(to) = WY. 
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Indeed, the path of matrices YW : J — O(n) is uniquely determined by the condi- 
tions (iv) in Lemma 3.5.12 and Lemma 3.5.13. It is given by the explicit formula 


W(t)v = B,,(t, to) YoPy (to, tT (y(t))v 


(3.5.2) 
+ Bi(t, 10) MP} (to, t)(v — M(y(d)v) 


fort € J and v € R”. We prove below a somewhat harder result where the motion is 
without twisting, wobbling, or sliding. It is in this situation that y and y’ determine 
one another (up to an initial condition). 


Remark 3.5.16 We can now give another interpretation of parallel transport. 
Given y : R > M and vo € Tym) M take M’ to be an affine subspace of the same 
dimension as M. Let (VY y, y’) be a motion of M along M’ without twisting (and, 
if you like, without sliding or wobbling). Let X’ € Vect(y’) be the constant vector 
field along y’ (so that V’X’ = 0) with value 
X'(t) = Youo, W := W(t). 
Let X € Vect(y) be the corresponding vector field along y so that 
W(t) X(t) = Yovo 

Then X(t) = ®,(f, to)vo. To put it another way, imagine that M is a ball. To define 
parallel transport along a given curve y roll the ball (without sliding) along a plane 
M' keeping the curve y in contact with the plane M’. Let y’ be the curve traced out 
in M’. If a constant vector field in the plane M’ is drawn in wet ink along the curve 


y’, it will mark off a (covariant) parallel vector field along y in M. 


Exercise 3.5.17 Describe parallel transport along a great circle in a sphere. 


3.5.4 Development 
A development is an intrinsic version of motion without sliding or twisting. 
Definition 3.5.18 A development of VM along M’ (on an interval /) is a triple 
(PD, y, y’) where y : I > M and y’: I — M' are smooth paths and @ is a family 
of orthogonal isomorphisms 

P(t) 2 TyyM > Ty@M' 


parametrized by ¢ € J, such that 


Pt)yO) =H) Gs3) 
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for all t € J and @ intertwines parallel transport, i.e. 
P(t)®,(t,s) = PD), (t, 5) P(s) (3.5.4) 


for all s,¢ € 7. In particular, the family ® of isomorphisms is smooth, i.e. if X is a 
smooth vector field along y, then the formula X’(t) := ®(t) X(t) defines a smooth 
vector field along y’. 


Lemma 3.5.19 Let J C R be an interval, y: I > M and y': I — M' be smooth 
curves, and ®(t) : T,yyM — T,q)M' be a family of orthogonal isomorphisms 
parametrized by t € I. Then the following are equivalent. 


(i) (@, y, y’) is a development. 
(ii) @ satisfies (3.5.3) and 


V'(@X) = VX (3.5.5) 


for all X © Vect(y). 
(iii) There exists a motion (W, y, y') without sliding and twisting such that 


PK) =V(OIrm forall € 1. (3.5.6) 


(iv) There exists a motion (Wy, y') of M along M' without sliding, twisting, and 
wobbling that satisfies (3.5.6). 


Proof That (3.5.4) is equivalent to (3.5.5) was proved in Lemma 3.5.12. This (i) 
is equivalent to (ii). That (iv) implies (iii) and (iii) implies (i) is obvious. To prove 
that (i) implies (iv) choose any fo € J and any orthogonal matrix W% € O(7) such that 
WIT, 4)M = @(fo) and define W(t) : R” > R” by (3.5.2). This proves Lemma 3.5.19. 

Oo 


Remark 3.5.20 The operations of reparametrization, inversion, and composition 
yield developments when applied to developments; i.e. if (®, y, y’) is a development 
of M along M’ on an interval J, (®’, y’, y”) is a development of M’ along M” on 
the same interval J, ando : J —> I is asmooth map of intervals, then the triples 


(Boa,you,y'oo), (G'.y',y)),  (®'G,y,y") 
are all developments. 


Theorem 3.5.21 (Development Theorem) Let py) € M and ty €R, let y’: R— M’ 
be a smooth curve, and let 


Do : TM = Ty1()M’ 


be an orthogonal isomorphism. Then the following holds. 
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(i) There exists a development (®, y, y’|,) on some open interval I CR containing 
to that satisfies the initial condition 


y (to) = Po, P(to) = Do. (3.5.7) 


(ii) Any two developments (®,, 1, y'|1,) and (®2, 2, y'|z,) as in (i) on two inter- 
vals I, and I, agree on the intersection I, 0 1, i.e. 


vi(t) = y2(t), P(t) = Py (t) 


foreveryt € 1,01. 
(iii) If M is complete, then (i) holds with I = R. 


Proof Let y : R — M be any smooth curve such that 
y (to) = Po 
and, for t € R, define the linear map 
P(t) : T,~M => TyyM' 
by 
P(t) := PD), (t, 19) PoP, (to, t). (3.5.8) 
This is an orthogonal transformation for every ¢ and it intertwines parallel transport. 
However, in general ®(t) y(t) will not be equal to y’(t). 
To construct a development that satisfies (3.5.3), we choose an orthonormal 
frame ey : R” — T,,M and, fort € R, define e(t) : R” > T,q)M by 
e(t) := D,(t, to)eo. (3.5.9) 
We can think of e(f) as a realm x m-matrix and the map 


R aS Ree : t be e(t) 


is smooth. In fact, the map t +> (y(t), e(t)) is a smooth path in the frame bun- 
dle #(M). Define the smooth map & : R > R” by 


V(t) = B,,(t, to) Poe (0). (3.5.10) 
We prove the following. 


Claim: The triple (®, y, y') is a development on an interval I C R if and only if the 
path t +> (y(t), e(t)) satisfies the differential equation 


(V(t), et) = Be(v@, e() (3.5.11) 
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for every t € I, where Bey) € Vect(F (M )) denotes the basic vector field associated 
to &(t) € R” (see equation (3.4.9)). 


The triple (&, y, y’) is a development on J if and only if 
P(t)y(t) = y'() 
for every t € J. By (3.5.8) and (3.5.10) this is equivalent to the condition 
Dy (t, to) PoP, (to, ty (t) = y(t) = B,,(t, to) Boeck (1), 

hence to 

Py (to, CY (0) = eo§ (2), 
and hence to 

Y(t) = y(t, to)eo§(t) = e(tE(t) (3.5.12) 

for every t € J. By (3.5.9) and the Gaui—Weingarten formula, we have 

e(t) = hyo Oe 


for every t € R. Hence it follows from (3.4.9) that (3.5.12) is equivalent to (3.5.11). 
This proves the claim. 

Parts (i) and (ii) follow directly from the claim. Part (iii) follows from the claim 
and Definition 3.4.12. This proves Theorem 3.5.21. Oo 


Remark 3.5.22 As any two developments (®;, 7), y’|7,) and (®2, y2, y’|7,) on two 
intervals 7, and J, that satisfy the initial condition (3.5.7) agree on J, M J, there is 
a development defined on J; U J. Hence there is a unique maximally defined de- 
velopment (®, y, y’|r), defined on a maximal interval J = I(to, po, ®o), associated 
to the initial data fo, po, Bo. 

Denote the space of initial data by 


P := \(t, p,®) ’ €R,p¢M,®:T,M = Ty aM (3.5.13) 
is an orthogonal transformation 
define the set D C R x P by 
D:= {(t, to, Po: Po) | (to, Po, Po) € P, t € (to, po, Lo)} (3.5.14) 


and let 


D — P : (t,to, Po, Po) > (t, y(t), @(4)), (3.5.15) 
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be the map which assigns to each (fo, po, oy) € P and each t € I(t, po, Bo) the 
value at time ¢ of the unique development (9, y, y’|;) associated to the inital condi- 
tion (fo, Po, po) on the maximal time interval J = I(to, po, ®o). Then the space P 
has a natural structure of a smooth manifold (in the intrinsic setting), and it fol- 
lows from Theorem 2.4.9 and the proof of Theorem 3.5.21 that D is an open subset 
of R x P and the map (3.5.15) is smooth. 

The smooth structure on P can be understood as follows. The space 


O(y’) = {(t,e) | (ve) € O(M)} 


is the pullback of the orthonormal frame bundle O(M’) > M’ under the curve 
y’: R > M‘ or, equivalently, is the orthonormal frame bundle of the pullback tan- 
gent bundle (y’)*7M. Thus O(y’) is a smooth submanifold of R x R’*’”". The 
group O(m) acts diagonally on O(y’) x O(M) and the action is free. Hence the 
quotient (O(y’) x O(M))/O(m) is a smooth manifold by Theorem 2.9.14, and it 
can be naturally identified with P via the bijection [(t, e’), (p,e)] > (t, p,e’ oe). 


Remark 3.5.23 The statement of Theorem 3.5.21 is essentially symmetric in M 
and M’ as the operation of inversion carries developments to developments. Hence 
given 


y:R- M, POEM’, to ER, Do 2 Ty(—))M > Ty M', 
we may speak of the development (9, y, y’) corresponding to y with initial condi- 


tions y'(to) = pp and ®(to) = Do. 


Corollary 3.5.24 (Motions) Let po € M and to € R, let y' : R > M’ beasmooth 
curve, and let Wy € O(n) be a matrix such that 


YT pM = Tyr). 
Then the following holds. 


(i) There exists a motion (W,y,y'|;) without sliding, twisting and wobbling 
on some open interval I C R containing to that satisfies the initial condi- 
tion y(to) = po and V(t) = W. 

(ii) Any two motions as in (i) on two intervals I, and I agree on the intersection 
I, NI. 

(iii) If M is complete, then (i) holds with I = R. 


Proof Theorem 3.5.21 and Remark 3.5.15. O 


Corollary 3.5.25 (Completeness) The following are equivalent. 


(i) M is complete, i.e. for every smooth curve — : R > R” and every ele- 
ment (po,eo) € F(M), there exists a smooth curve B:R— F(M) such 
that B(t) = Beqa)(B(t)) for all t € R and B(O) = (po, eo) (Definition 3.4.12). 
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(ii) For every smooth curve & : R > R"” and every element (po, €o) € O(M), there 
is asmooth curve a : R > O(M) such that a(t) = Beq)(a(t)) for everyt € R 
and (0) = (po, 0). 

(iii) For every smooth curve y’ : R > R”, every po € M, and every orthogonal 
isomorphism ®o : T,,M — IR” there exists a development (®, y, y') of M 
along M' = R"” on all of R that satisfies y(0) = po and &(0) = Do. 


Proof We have already noted that the basic vector fields are all tangent to the 
orthonormal frame bundle O(M) C #(M). Now note that if a smooth curve 
I> F(M): tb B(t) = (v(t), e(t)) on an interval J C R satisfies the differen- 
tial equation 


B(t) = Be (BO) 
for all t, then so does the curve 
I > F(M):t > a*B(t) = (V(t), e(t) 0a) 

for every a € GL(m,R). Since any frame eo : R” — T,,M can be carried to any 
other (in particular an orthonormal one) by a suitable matrix a € GL(m,R), this 
shows that (1) is equivalent to (ii). 

That (i) implies (iii) was proved in Theorem 3.5.21. 

We prove that (iii) implies (ii). Fix a smooth map € : R — R” and an ele- 
ment (po, eo) € O(M). Define 

By := €5': TM > R™ 


and 
t 
y(t) := : &(s)ds € R” fort ER. 
0 
By (ii) there exists a development (®, y, y’) of M along R” on all of R that satisfies 
the initial conditions 
yO) = po, (0) = %. 
Then 
P(t) = DoF, (0,1): TyM>R",  SPOHPO=VO=EO 
for all t € R by Definition 3.5.18. Define 


e(t) := D,(t, 0)ep = O(t) |: R™ > T,wyM 
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fort € R. Then (y, e) : R ~ F(M) is a smooth curve that satisfies the initial con- 
dition (y(0), e(0)) = (Po, eo) and the differential equation 


y(t) = Ot) EC) = eEM), 
E(t) = hy (Ye) = hyw(eMEO) es) 
by the Gaui—Weingarten formula. Thus 
(70), €(@)) = Bem(v@, e@)) 
for all ¢ € R. This proves Corollary 3.5.25. O 


It is of course easy to give an example of a manifold which is not complete; 
e.g. if (®, y, y’) is any development of M along M’, then M \ {y(t1)} is not com- 
plete as the given development is only defined for t 4 t,. In Sect. 4.6 we give 
equivalent characterizations of completeness. In particular, we will see that any 
compact submanifold of IR” is complete. 


Exercise 3.5.26 An affine subspace of R” is a subset of the form 
E=p+E={p+t+ov|vek} 


where E C R” is a linear subspace and p € R”. Prove that every affine subspace 
of R” is a complete submanifold. 


3.6 Christoffel Symbols 


The goal of this subsection is to examine the covariant derivative in local coordi- 
nates on an embedded manifold M C R"” of dimension m. Let 
g¢:U>-2 
be a coordinate chart, defined on an M-open subset U C M with values in an open 
set 22 C R”, and denote its inverse by 
vi=go!:2>UCM. 


At this point it is convenient to use superscripts for the coordinates of a vector 
x € 92. Thus we write 
= eee cl EQ. 


If p = w(x) € U is the corresponding element of M, then the tangent space of M 
at p is the image of the linear map dw(x) : R” — R" (Theorem 2.2.3) and thus 
two tangent vectors v, w € T,M can be written in the form 


oy 
ox! 


v= dy(x)g =) 1s F(a), 
me (3.6.1) 


m : a 
w=dv@yn= on) 
j=l 
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Fig. 3.7 A vector field along 
a curve in local coordinates 


U 
M vf | 
9 
for € = (€!,...,&”) € R” and n = (n',..., 7") € R”. Recall that the restriction 


of the inner product in the ambient space R” to the tangent space is the first funda- 
mental form g, : 7,M x T,M — R (Definition 3.1.1). Thus 


m 


gp(v,w) = (v,w) = D> Ei gij(x)n!, (3.6.2) 


ij=l 
where the functions gj; : 82 — R are defined by 


oy 
ox! 


a] 
Bij (x) i= ( (x), | forx € Q. (3.6.3) 


In other words, the first fundamental form is in local coordinates represented by the 
matrix valued function g = (gjj)j;-) : 2 > R"™". 

Now let c = (c!,... ,c') : IT — 2 be asmooth curve in 92, defined on an inter- 
val J C R, and consider the curve 


y=woc:Il—M 


(see Fig. 3.7). Our goal is to describe the operator X +» VX on the space of vector 
fields along y in local coordinates. Let X : J — R” be a vector field along y. Then 


X(1) € Ty)M = Tye) M = im (dy(c(t)) : R" > R") 
for every t € J and hence there exists a unique smooth function 
&= (é',...,€"): 1 > R™ 


such that 


m 


.  ¢O 
X() =dyeapen = ren 
1=1 


ax! 


(c(t). (3.6.4) 
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Differentiate this identity to obtain 


m m 


X() = dF Oe Ye) +) FOHOs oa (c(t)). (3.6.5) 


i,j=l 


We examine the projection VX(t) = I(y(t))X(t) of this vector onto the tan- 
gent space of M at y(t). The first summand on the right in (3.6.5) is already 
tangent to M. For the second summand we simply observe that the vector 
IT(w(x))0?w/dx'dx/ lies in tangent space Tyq)M and can therefore be expressed 
as a linear combination of the basis vectors dy/dx!,...,0w/dx". The coefficients 
will be denoted by [jj K(x). Thus there exist smoot fonctions is 92 — R for 
i,j,k =1,...,m gush tliat 


m 


VQ) = 2 Koy Yo) (3.6.6) 


ayer 


for all x € Q andalli, j € {1,...,m}. The coefficients ry : 2 — R are called the 
Christoffel symbols associated to the coordinate chart ¢ : U > (2. To sum up we 
have proved the following. 


Lemma 3.6.1 Letc: I — 92 be asmooth curve and define 
yi=woc:Il—M. 


If&:I +R" isasmooth map and X € Vect(y) is given by (3.6.4), then its covariant 
derivative at time t € I is given by 


m m 


V(t) = Df EK) + DO CME (EO 


k=1 ij=l 


oy 


age CO)» (3.6.7) 


where the rk are the Christoffel symbols defined by (3.6.6). 


Our next goal is to understand how the Christoffel symbols are determined by 
the metric in local coordinates. Recall from equation (3.6.2) that the inner products 
on the tangent spaces inherited from the standard Euclidean inner product on the 
ambient space R” are in local coordinates represented by the matrix valued function 


g= = (gi; = iL -Q => Rm 


given by 


oy oO 
8ij = ( y a : (3.6.8) 
R" 


ax! ax/ 


We shall see that the Christoffel symbols are completely determined by the func- 
tions gj; : S2 — R. Here are first some elementary observations. 
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Remark 3.6.2 The matrix g(x) € R””*”” is symmetric and positive definite for 
every x € 92. This follows from the fact that the matrix dy(x) € R”*” has rank m 
and the matrix g(x) is given by 


g(x) = dy(x)"dy(x) 
Thus €'g(x)& = |dw(x)é|* > 0 for all € € R” \ {0}. 
Remark 3.6.3. For x € 92 we have det(g(x)) > 0 by Remark 3.6.2 and so the 


matrix g(x) is invertible. Denote the entries of the inverse matrix g(x)~! € R”™*” 
by g*"(x). They are determined by the condition 


m 


Say Qe" =o = 


jot 


l, ifi=k, 
0, iff 4k. 


Since g(x) is symmetric and positive definite, so is its inverse matrix g(x)~!. In 
particular, we have g(x) = g(x) forall x € Q andallk,@ € {1,...,m}. 


Remark 3.6.4 Suppose that X,Y € Vect(y) are vector fields along our curve 
y=woc:I—>M andé,n: I] > R"” are defined by 


m ; F) m : a 
xH= KOT EO, YO= LOX. 
i=l 


j=l 


Then the inner product of X(t) and Y(t) is given by 


m 


(X(),. YO) = Yo EF Ogi (en! (0. 


ij=l 


Lemma 3.6.5 (Christoffel symbols) Let 922 Cc R” be an open set and let 
gij 1 2—>R for i,j =1,...,m be smooth functions such that each matrix 
(gi; (x) ia is symmetric and positive definite. Let i : 2 — R be smooth 


functions fori, j,k = 1,...,m. Then the ry satisfy the conditions 


m 


Ogi; ; 
Math Soe = i( sal + set) 3.6.9) 
k=1 


fori, j,k,€=1,...,m if and only if they are given by 


m 


: al ( Ogi Oge; 98: 
Wa os: : ke J J 
| i= a g 5 ( Axi + Axi _ ax F (3.6.10) 


If the ry are defined by (3.6.6) and the gj; by (3.6.8), then the rk satisfy (3.6.9) 
and hence are given by (3.6.10). 
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Proof Suppose that the ri are given by (3.6.6) and the g;; by (3.6.8). Let 
c: I>, —,n: I > R™ 
be smooth functions and suppose that the vector fields X, Y along the curve 
yi=ypoc:Il>~M 
are given by 
X(t) = ye (ew), Y= > n! Gree ¥ (et). 
i=l j=l 


Dropping the argument ¢ in each term, we obtain from Remark 3.6.4 and 
Lemma 3.6.1 that 


= yo sii (c)E'n’, 
tJ 
(X, VY) = Desie(os i oe pone |, 


(VX,Y) = Lew) B+ DTH OEE Ia Z 


Hence it follows from equation (3.2.5) in Lemma 3.2.4 that 


d 
0 = TAX, ¥) — (x, VY) —(VX,Y¥) 


= (stot + gi (cE i! +h “Bi ( JE! n/ at 


1] 


~ 2a gH OE K— ~ gilOlié'nié 


i,j,k e 
Eat (En — > ga (Mi (ce! n! 
dK ij ke 
dg; 
= -»(% SY (¢) — DB OF EO) - Tenteortton eal 
Lit 


This holds for all smooth maps c : J > 9 and —,n: I > R”, so the rf satisfy 
the second equation in (3.6.9). That they are symmetric ini and 7 is obvious. 
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To prove that (3.6.9) is equivalent to (3.6.10), define 


Thi; = Sealy: 
k=1 
Then (3.6.9) is equivalent to 


d8ij 
Peig = Vaji, oe = Tye + Tjie- 


and (3.6.10) is equivalent to 


1 ( Og¢i Oge; Ogi 
Ig; = - — — : 
ma (#5 ae. a 


If the [;; are given by (3.6.13), then they satisfy 


Viz = Lei 
and 

Ogi; , O8ie  O8je | O8ji | Ajo — A8ie 
Wet ie = 2 peer J i 
pe et gee axé - axi Oxi * ax? - axt xs 

d8ij 

= 224 

ax? 


for all 7, j, . Conversely, if the I%;; satisfy (3.6.12), then 


Oi; r 

aa — pe + jils 

Ogi =ly,,+]1 
aad Lea tig = Fey + Pye, 
OLej = | 
a = Py + Tjei = Tey + Thue 


Take the sum of the last two minus the first of these equations to obtain 


O8¢i Oge; Ogi; 
- —— = 214;;. 
axJ oe ax! = ax# aj 
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(3.6.11) 


(3.6.12) 


(3.6.13) 


Thus (3.6.12) is equivalent to (3.6.13) and so (3.6.9) is equivalent to (3.6.10). This 


proves Lemma 3.6.5. 


oO 
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3.7. Riemannian Metrics* 


We wish to carry over the fundamental notions of differential geometry to the in- 
trinsic setting. First we need an inner product on the tangent spaces to replace the 
first fundamental form in Definition 3.1.1. This is the content of Definition 3.7.1 
and Lemma 3.7.4 below. Second we must introduce the covariant derivative of a 
vector field along a curve. With this understood all the definitions, theorems, and 
proofs in this chapter carry over in an almost word by word fashion to the intrinsic 
setting. 


3.7.1 Existence of Riemannian Metrics 


We will always consider norms that are induced by inner products. But in general 
there is no ambient space that can induce an inner product on each tangent space. 
This leads to the following definition. 


Definition 3.7.1 Let M be a smooth m-manifold. A Riemannian metric on M is 
a collection of inner products 


T,M xT,M > R:(v,w) g,(v,w), 
one for every p € M, such that the map 


M —>R: pr g,(X(p), Y(p)) 


is smooth for every pair of vector fields X, Y € Vect(M). We will also denote the 
inner product by (v,w), and drop the subscript p if the base point is understood 
from the context. A smooth manifold equipped with a Riemannian metric is called 
a Riemannian manifold. 


Example 3.7.2. If M C R” is a smooth submanifold, then a Riemannian metric 
on M is given by restricting the standard inner product on R” to the tangent spaces 
T,M CR". This is the first fundamental form of an embedded manifold (see Defi- 
nition 3.1.1). 


More generally, assume that M is a Riemannian m-manifold in the intrinsic 
sense of Definition 3.7.1 with an atlas A = {(¢y, Uw)}aea- Then the Riemannian 
metric g determines a collection of smooth functions 


La = (Sa,ij i=l ‘ da (Ue) —> Ree. (3.7.1) 
one for each a € A, defined by 


ETgy(x)n:= gp(v,w), du(p) =x, adda(p)v =& doa(p)w =n, (3.7.2) 


for x € dy(Uy) and &,n € R”. 
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Each matrix gy(x) is symmetrix and positive definite. Note that the tangent vec- 
tors v and w in (3.7.2) can also be written in the form 


v = [a, E]p, w = [a, nlp. 


Choosing standard basis vectors 


& = @, n= ej 
in R” we obtain 
(ily = do(p)-'es =: op) 
and hence 
re 2 
suis) = (550) 55 a'))} G73) 


For different coordinate charts the maps g, and gz are related through the transition 
map 


dpa ‘= $8 ° Py: bu(Ua M Up) > $p(Ua O Up) 

via 

Sa(x) = dbpa(x)' ga(bpa(x))dbpa(x) (3.7.4) 
for x € dy(Uyg M Ug). Equation (3.7.4) can also be written in the shorthand notation 

Sa = bpa8p 
fora, B € A. 
Exercise 3.7.3. Every collection of smooth maps 
8a? ba(Ua) > R 


with values in the set of positive definite symmetric matrices that satisfies (3.7.4) 
for all w, 6 € A determines a global Riemannian metric via (3.7.2). 


In this intrinsic setting there is no canonical metric on M (such as the metric 
induced by R” on an embedded manifold). In fact, it is not completely obvious that 
a manifold admits a Riemannian metric and this is the content of the next lemma. 


Lemma 3.7.4 Every paracompact Hausdorff manifold admits a Riemannian met- 
ric. 
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Proof Let m be the dimension of M and let A = {(¢g, Ux)}uea be an atlas on 
M. By Theorem 2.9.9 there is a partition of unity {6,}ve4 subordinate to the open 
cover {Uy tvea. Now there are two equivalent ways to construct a Riemannian met- 
ricon M. 

The first method is to carry over the standard inner product on R” to the tangent 
spaces 7, M for p € U, via the coordinate chart @,, multiply the resulting Rieman- 
nian metric on U, by the compactly supported function 6,, extend it by zero to all 
of M, and then take the sum over all a. This leads to the following formula. The 
inner product of two tangent vectors v, w € T, M is defined by 


(v, w)p = )~ Oa(p)(dgalp)v, dba(p)w), (3.7.5) 


peug 


where the sum runs over all w € A with p € U, and the inner product is the standard 
inner product on R”. Since supp(6.) C Uy for each w and the sum is locally finite 
we find that the function 


M —>R: p® (X(p),Y(p))p 


is smooth for every pair of vector fields X, Y € Vect(M). Moreover, the right hand 
side of (3.7.5) is symmetric in v and w and is positive for v = w 4 0 because 
each summand is nonnegative and each summand with 6,(p) > 0 is positive. Thus 
equation (3.7.5) defines a Riemannian metric on M. 

The second method is to define the functions 


Se 2 by (Ux) > RO™ 
by 


SalX) = D> Oy (G,'(&))dbyalx)"ddya(x) (3.7.6) 


yeA 


for x € y(Ua) where each summand is defined on dy (Ug M U,) and is understood 
to be zero for x ¢ da(Ug M U,). We leave it to the reader to verify that these func- 
tions are smooth and satisfy the condition (3.7.4) for all a, B € A. Moreover, the 
formulas (3.7.5) and (3.7.6) determine the same Riemannian metric on M. (Prove 
this!) This proves Lemma 3.7.4. Oo 


3.7.2 Two Examples 


Example 3.7.5 (Fubini-Study metric) The complex projective space carries 
a natural Riemannian metric, defined as follows. Identify CP” with the quo- 
tient of the unit sphere S?”+! C C+! by the diagonal action of the circle S', 
ic. CP” = $?”*!/S1. Then the tangent space of CP” at the equivalence class 


[Zz] = [Zo : +++: Zn] € CP” 
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of a point z = (Zo,...,Z,) € S7”*! can be identified with the orthogonal com- 
plement of Cz in C"*!. Now choose the inner product on 7;,;CP” to be the one 
inherited from the standard inner product on C”*! via this identification. The re- 
sulting metric on CP” is called the Fubini-Study metric. Exercise: Prove that the 
action of U(n + 1) on C”*! induces a transitive action of the quotient group 


PSU(n + 1) := U(n + 1)/S! 
by isometries. If z ¢ S?”*!, prove that the unitary matrix 
g:=2zz*-1 


descends to an isometry ¢ on CP” with fixed point p := [z] and dé(p) = —id. 
Show that, in the case n = 1, the pullback of the Fubini—Study metric on CP! under 
the stereographic projection 
> 1 X, + 1X2 
S* \ {(0,0, 1)} > CP’ \ {[O: 1}: (1, %2,%3) BY 1: = 
— X3 


is one quarter of the standard metric on S?. 


Example 3.7.6 Think of the complex Grassmannian G;,(C") of k-planes in C” as 
a quotient of the space 


Fi(C") = {D € crxk | D*D= 1} 


of unitary k-frames in C” by the right action of the unitary group U(k). The space 
#,(C") inherits a Riemannian metric from the ambient Euclidean space C”**. 
Show that the tangent space of G,(C”) at a point A = imD, with D € F,(C”) 
can be identified with the space 


TiGKe”) = \D ec | D*D = OF 


Define the inner product on this tangent space to be the restriction of the standard 
inner product on C”* to this subspace. Exercise: Prove that the unitary group U(n) 
acts on G;,(C”) by isometries. 


3.7.3. The Levi-Civita Connection 


A subtle point in this discussion is how to extend the notion of covariant derivative 
to general Riemannian manifolds. In this case the idea of projecting the derivative 
in the ambient space orthogonally onto the tangent space has no obvious analogue. 
Instead we shall see how the covariant derivatives of vector fields along curves can 
be characterized by several axioms and these can be used to define the covariant 
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derivative in the intrinsic setting. An alternative, but somewhat less satisfactory, 

approach is to carry over the formula for the covariant derivative in local coordinates 

to the intrinsic setting and show that the result is independent of the choice of the 

coordinate chart. Of course, these approaches are equivalent and lead to the same 

result. We formulate them as a series of exercises. The details are straightforward. 
Assume throughout that M is a Riemannian m-manifold with an atlas 


A = {(ba, Ua) Sued 
and suppose that the Riemannian metric is in local coordinates given by 
8a = (Sais) jai: Pa(Ua) > RR 
for a € A. These functions satisfy (3.7.4) for all a, B € A. 


Definition 3.7.7 Let f : N — M beasmooth map between manifolds. A vector 
field along f is a collection of tangent vectors 


X(q) € Tr M, 
one for each g € N, such that the map 


N—>TM :q' (f(g), X@)) 


is smooth. The space of vector fields along f will be denoted by Vect(/). 


As before we will not distinguish in notation between the collection of tangent 
vectors X(q) € Ty(q)M and the associated map N — TM and denote them both by 
X. The following theorem introduces the Levi-Civita connection as a collection of 
linear operators V : Vect(y) — Vect(y), one for each smooth curve y : J > M. 


Theorem 3.7.8 (Levi-Civita connection) There exists a unique collection of lin- 
ear operators 


V : Vect(y) — Vect(y) 


(called the covariant derivative), one for every smooth curve y : I — M onan open 
interval I C R, satisfying the following axioms. 


(Leibniz Rule) For every smooth curve y : I > M, every smooth function 1: I > R, 
and every vector field X € Vect(y), we have 


V(AX) =AX + AVX. (3.7.7) 


(Chain Rule) Let 822 C R” be an open set, let c: I > 2 be a smooth curve, 
let y : 82 — M be a smooth map, and let X be a smooth vector field along y. 
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Denote by V;X the covariant derivative of X along the curve x' +> y(x) (with 
the other coordinates fixed). Then V;X is a smooth vector field along y and the 
covariant derivative of the vector field X 9c € Vect(y 0 c) is 


V(X oc) = )o WV X(C(). (3:7.8) 


j=l 
(Riemannian) For any two vector fields X,Y € Vect(y) we have 


d 

ae? = (VX, Y)+ (X, VY). (3.7.9) 
(Torsion-free) Let I, J C R be open intervals and y : I x J — M be a smooth map. 
Denote by V, the covariant derivative along the curve s +> y(s,t) (with t fixed) and 
by V, the covariant derivative along the curve t +> y(s,t) (with s fixed). Then 


Vs:y = V;dsy. (3.7.10) 


Proof The proof is based on a reformulation of the axioms in local coordinates. 
The (Leibniz Rule) and (Chain Rule) axioms assert that the covariant derivative 
is in local coordinates given by Christoffel symbols ry as in equation (3.6.7) in 
Lemma 3.6.1. The (Riemannian) and (Torsion-free) axioms assert that the Christof- 
fel symbols satisfy the equations in (3.6.9) and hence, by Lemma 3.6.5, are given 
by (3.6.10). (See also Exercise 3.7.10.) This proves Theorem 3.7.8. i 


Exercise 3.7.9 The Christoffel symbols of the Riemannian metric are the func- 
tions P*.. : dy(Uy) > R. defined by 


aif 


m 


1(Oga0i . 98a0; Iai; 
= ne ae ce 3.7.11 
on D8 3( Ox/ ™ ox! ax! ) ( ) 


(see Lemma 3.6.5). Prove that they are related by the equation 


k! 2 i! i 
2960 pk = Pb Fa +h fy 


axk 4 ~~ Axidxi ax! ax/ 


for alla, B € A. 


Exercise 3.7.10 Denote We := ¢,!: ¢y(Ux) > M. Prove that the covariant 
derivative of a vector field 


OWe 
sci (u(t) 


XO => EO 
i=l 
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along y = Wa 0 Cy : 1 — M is given by 


m 


VX) => [EO + OE CO EOGO 


k=1 ij=l 


IVa 


ax* 


(cu(t)). (3.7.12) 


Prove that V X is independent of the choice of the coordinate chart. 


Exercise 3.7.11 Let 2 C R? be open and A : 2 — (0,00) be a smooth function. 
Let g : Q > R?*? be given by 


,_ (Aw) 0 
co =( 0 ra 


Compute the Christoffel symbols rk via (3.6.10). 


Exercise 3.7.12 Let #: S* \{(0,0, 1)} > C be the stereographic projection, given 


by 
_ Pi P2 
oP) = (> ) 


Prove that the metric g : R* — R?*? has the form g(x) = A(x)1 where 


4 


came ESTO 


for x = (x!,x”) € R?. 


3.7.4 Basic Vector Fields in the Intrinsic Setting 


Let M be a Riemannian m-manifold with an atlas A = {(¢y, Ue) }yc4. Then the 
frame bundle (3.4.1) admits the structure of a smooth manifold with the open cover 
U, := a~!(U,) and coordinate charts 


by 1 Ua > ba(Uy) x GL(m) 
given by 
Py(P.e) := (ba(P), dha(p)e). 


The derivatives of the horizontal curves in Definition 3.4.6 form a horizontal sub- 
bundle H Cc T ¥ (M) of the tangent bundle of the Frame bundle whose fibers H,,.) 
over an element (p,e) € F(M) can in local coordinates be described as follows. 
Let 


xX := da(p), a := doa(p)e € GL(mn), (3.7.13) 
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and let (x,a@) € R” x R”*"". This pair has the form 


(%,@) = do, (p.e(P.2), — (B.2) € Hye, (3.7.14) 
if and only if 
a =— Tk 8a} (3.7.15) 
ij=l 


fork, £=1,...,m, where the functions iar : by (U,,) > R are the Christoffel sym- 
bols defined by (3.7.11). Thus a tangent vector (p,@) € T,p,2)F (M) is horizontal 
if and only if its coordinates (x,@) in (3.7.14) satisfy (3.7.15). Hence, for every 
vector € € R”, there exists a unique horizontal vector field By € Vect(F (M)) (the 


basic vector field associated to €) such that 


dm(p.e)Be(p.e) = e& 


for all (p,e) € F(M). This vector field assigns to a pair (p,e) € F(M) with 
the coordinates (x,a) € R” x GL(m) as in (3.7.13) the horizontal tangent vec- 
tor (p,e) € Aipe) C Tipe) F(M) whose coordinates (x,@) € R” x R”*” sat- 
isfy (3.7.15) and ¥ = aé. 


Exercise 3.7.13 Verify the equivalence of (3.7.14) and (3.7.15). Prove that the 
notion of a horizontal tangent vector of #(M) is independent of the choice of the 
coordinate chart. Hint: Use Exercise 3.7.9. 


Exercise 3.7.14 Examine the orthonormal frame bundle O(M) in the intrinsic 
setting. 


Exercise 3.7.15 Carry over the proofs of Theorem 3.3.4, Theorem 3.3.6, and The- 
orem 3.5.21 to the intrinsic setting. 


Check for 
updates 


Geodesics 


This chapter introduces geodesics in Riemannian manifolds. It begins in Sect. 4.1 
by introducing geodesics as extremals of the energy and length functionals and 
characterizing them as solutions of a second order differential equation. In Sect. 4.2 
we show that minimizing the length with fixed endpoints gives rise to an intrinsic 
distance function d : M x M — R which induces the topology M inherits from 
the ambient space R”. Section 4.3 introduces the exponential map, Sect. 4.4 shows 
that geodesics minimize the length on short time intervals, Sect. 4.5 establishes 
the existence of geodesically convex neighborhoods, and Sect. 4.6 shows that the 
geodesic flow is complete if and only if (MM, d) is a complete metric space, and that 
in the complete case any two points are joined by a minimal geodesic. Section 4.7 
discusses geodesics in the intrinsic setting. 


4.1 Length and Energy 


This section explains the length and energy functionals on the space of paths with 
fixed endpoints and introduces geodesics as their extremal points. 


4.1.1. The Length and Energy Functionals 


The concept of a geodesic in a manifold generalizes that of a straight line in 
Euclidean space. A straight line has parametrizations of form fh p+ o(t)v 
where o : R — R is a diffeomorphism and p,v € R”. Different choices of o yield 
different parametrizations of the same line. Certain parametrizations are preferred, 
for example those parametrizations which are “proportional to the arclength’, i.e. 
where o(t) = at + b for constants a,b € R, so that the tangent vector a(t)uv has 
constant length. The same distinctions can be made for geodesics. Some authors use 
the term geodesic to include all parametrizations of a geodesic while others restrict 
the term to cover only geodesics parametrized proportional to the arclength. We 
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follow the latter course, referring to the more general concept as a “reparametrized 
geodesic”. Thus a reparametrized geodesic need not be a geodesic. 
We assume throughout that M C R” is a smooth m-manifold. 


Definition 4.1.1 (Length and energy) Let J = [a,b] C R be a compact interval 
with a < b and let y : J — M bea smooth curve in M. The length L(y) and the 
energy Ey) are defined by 


b 

Ly) = | polat. (4.1.1) 
; ; 

E(y) := 5 [wore (4.1.2) 


A variation of y is a family of smooth curves y, : J — M, where s ranges over the 
reals, such that the map R x J > M : (s,t) + y,(t) is smooth and yo = y. The 
variation {y,;};eR is said to have fixed endpoints iff y,(a) = y(a) and y,(b) = y(b) 
foralls eR. 


Remark 4.1.2 The length of a continuous function y : [a,b] > R” can be 
defined as the supremum of the numbers yy) — y(t;-1)| over all parti- 
tions d = fo < t) <---<ty =) of the interval [a,b]. By a theorem in first year 
analysis [64] this supremum is finite whenever y is continuously differentiable and 
is given by (4.1.1). 


We shall sometimes suppress the notation for the endpoints a,b € 7. When 
y(a) = p and y(b) = q we say that y is a curve from p to g. One can always 
compose y with an affine reparametrization t/ = a + (b — a)t to obtain a new curve 
y'(t) := y(t’) on the unit interval 0 < t < 1. This new curve satisfies L(y’) = L(y) 
and E(y') = (b—a)E(y). More generally, the length L(y), but not the energy 
E(y), is invariant under reparametrization. 


Remark 4.1.3 (Reparametrization) Let J = [a,b] and J’ = [a’, b'| be compact 
intervals. If y : J > R” is asmooth curve anda : J’ > J is asmooth function such 
that o(a’) = a,o(b’) = b, and d(t') > 0 forall t’ € J’, then the curves y and y oo 
have the same length. Namely, 


Db’ Lb! 
d 
L(yoo) = [|geree| ar = [prowm|eey dt'=L(y). (4.1.3) 


a’ 


Here second equality follows from the chain rule and the third equality follows from 
the change of variables formula for the Riemann integral. 
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Theorem 4.1.4 (Characterization of geodesics) Let J = [a,b] C R be a com- 
pact interval and let y : I —-> M bea smooth curve. Then the following are equiv- 
alent. 


(i) y is an extremal of the energy functional, i.e. every variation {ys}ser of Y 
with fixed endpoints satisfies 


4| jE (ys) = 0. 


(ii) y is parametrized proportional to the arclength, i.e. the velocity |y(t)| =c = 0 
is constant, and either y is constant, i.e. y(t) = p = q forallt € I,orc >0 
and y is an extremal of the length functional, i.e. every variation {Y5}scR Of Y 
with fixed endpoints satisfies 


£ veg lt (Ve) — 0. 


(iii) The velocity vector of y is parallel, i.e. Vy(t) = Oforallt € I. 

(iv) The acceleration of y is normal to M, i.e. y(t) L TywyM forallt € 1. 

(v) If (@,y,y') is a development of M along M’' = R", then y’: I > R” is a 
straight line parametrized proportional to the arclength, i.e. py’ = 0. 


Proof See Sect. 4.1.3. oO 


Definition 4.1.5 (Geodesic) A smooth curve y : J — M onan interval J is called 
a geodesic iff its restriction to each compact subinterval satisfies the equivalent 
conditions of Theorem 4.1.4. So y is a geodesic if and only if 


Vy(t)=0  forallt eZ. (4.1.4) 


By the GauB—-Weingarten formula (3.2.2) with X = y this is equivalent to 


H(t) =hyw Q(t). yO) forall t eT. (4.1.5) 


Remark 4.1.6 


(i) The conditions (i) and (ii) in Theorem 4.1.4 are meaningless when J is not 
compact because then the curve has at most one endpoint and the length and 
energy integrals may be infinite. However, the conditions (iii), (iv), and (v) in 
Theorem 4.1.4 are equivalent for smooth curves on any interval, compact or 
not. 

(ii) The function s + E(y;) associated to a smooth variation is always smooth and 
so condition (i) in Theorem 4.1.4 is meaningful. However, more care has to be 
taken in part (ii) because the function s +> L(y;) need not be differentiable. It 
is differentiable at s = 0 whenever y(t) 4 0 forall t € J. 
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4.1.2 The Space of Paths 


Fix two points p,q € M and acompact interval J = [a, b] and denote by 
Qyq t= Rpg) := {y: 1 > M | y is smooth and y(a) = p, y(b) = q} 


the space of smooth curves in M from p to q, defined on the interval 7. Then the 
length and energy are functionals L, F : 2, — R and their extremal points can 
be understood as critical points as we now explain. 

We may think of the space 2, as a kind of “infinite-dimensional manifold”. 
This is to be understood in a heuristic sense and we use these terms here to 
emphasize an analogy. Of course, the space (2, is not a manifold in the strict 
sense of the word. To begin with it is not embedded in any finite-dimensional 
Euclidean space. However, it has many features in common with manifolds. The 
first is that we can speak of smooth curves in 2,4. Of course {2,4 is itself a space 
of curves in M. Thus a smooth curve in Qyg would then be a curve of curves, 
namly a map R > 92,,:5+> y, that assigns to each real number s a smooth 
curve y; : 1 > M satisfying y,(a) = p and y,(b) = q. We shall call such a curve 
of curves smooth iff the associated map R x J > M : (s,t) > y;(t) is smooth. 
Thus smooth curves in §2,., are the variations of y with fixed endpoints introduced 
in Definition 4.1.1. 

Having defined what we mean by a smooth curve in (2,,, we can also differenti- 
ate such a curve with respect to s. Here we can simply recall that, since M C R’, 
we have a smooth map R x J — R” and the derivative of the curve s > y; in Qy.4 
can simply be understood as the partial derivative of the map (s,t) > ys(t) with 
respect to s. Thus, in analogy with embedded manifolds, we define the tangent 
space of the space of curves (2, at y as the set of all derivatives of smooth 
curves R > 92,4: 5 +> ys passing through y, i.e. 


¥s|R > Qyq 2S + ys is smooth and yo = y>. 


s=0 


) 
Ty Qpq°= tas 


Let us denote such a partial derivative by X(t) := a sos (t) € T,)M. Thus we 
obtain a smooth vector field along y. Since y,(a) = p and y,(b) = q for all s, this 
vector field must vanish at t = a, b. This suggests the formula 


Ty Qpq = {X € Vect(y) | X(a) = 0, X(b) = 0}. (4.1.6) 


That every tangent vector of the path space (2,,, at y is a vector field along y van- 
ishing at the endpoints follows from the above discussion. The converse inclusion 
is the content of the next lemma. 


Lemma 4.1.7 Let p,q eM, y € Qyq, and X € Vect(y) with X(a) = 0 and 
X(b) = 0. Then there exists a smooth map R > S2pq 1 8 +> ys such that 


yo(t) = y(t), ys(t) = X(t) forallt eT. (4.1.7) 
=0 


s= 


a 
Os 
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Proof The proof has two steps. 


Step 1 There exists snooth map M x I > R": (r,t) & Y;,(r) with compact sup- 
port such that Y,(r) € T,M for allt € I andr € M and Y,(r) = Y,(r) = 0 for 
allr e M. 


Define Z;(r) := I(r) X(t) for t € J and r € M. Choose an open set U C R" 
such that y(1) C U and UNM is compact (e.g. take U := ,<,<;, Be(y(t)) 
for ¢ > 0 sufficiently small). Now let 6 : R” — [0, 1] be a smooth cutoff function 
with support in the unit ball such that 6(0) = 1 and define the vector fields Y, 
by Y,(r) := Ble (r — y(t)))Z,(r) fort € J andre M. 


Step 2 We prove the lemma. 


The vector field Y, : M — TM in Step 1| is complete for each t. Thus there ex- 
ists a unique smooth map R x I > M : (s,t) + y,;(t) such that, for each ¢t € T, 
the curve R > M :s ++ y,(t) is the unique solution of the differential equation 
Ay, (t) = Y,(y;(t)) with yo(t) = y(t). These maps y, satisfies (4.1.7) by Step 1.0 


We can now define the derivative of the energy functional F at y in the direc- 
tion of a tangent vector X € T,.Q,, by 


d 


E(ys), (4.1.8) 
=0 


Ss 


where s +> ys is as in Lemma 4.1.7. Similarly, the derivative of the length func- 
tional L at y in the direction of X € 7, Qpq is defined by 


d 
dL(y)X := — 
(vy) cP 


L(ys). (4.1.9) 
s=0 
To define (4.1.8) and (4.1.9) the functions s +> E(y,) and s+» L(y,) must be 
differentiable at s = 0. This is true for E but it only holds for L when y(t) 4 0 for 
all t € J. Second, we must show that the right hand sides of (4.1.8) and (4.1.9) 
depend only on X and not on the choice of {y,};eR. Third, we must verify 
that dE(y) : T, 82,4 — Rand dL(y) : 2,4 — R are linear maps. This is an exer- 
cise in first year analysis (see also the proof of Theorem 4.1.4). A curve y € 2y4 
is then an extremal point of E (respectively L when y(t) 4 0 for all ¢) if and only 
if dE(y) = 0 (respectively dL(y) = 0). Such a curve is also called a critical point 
of E (respectively L). 


4.1.3 Characterization of Geodesics 


Proof of Theorem 4.1.4 The equivalence of (iii) and (iv) follows directly from the 
equations Vy(t) = I7(y(t))}(t) and ker(I7(y(t))) = TyyM+. 
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We prove that (i) is equivalent to (iii) and (iv). Let X € 7,2) and choose 
a smooth curve of curves R > Q2Qn4 : 5 +> ys satisfying (4.1.7). Then the func- 
tion (s,t) + |¥s(t)|° is smooth and hence 


d 
dE(y)X = ae E(ys) 
S]s—0 
d 1 ; 
=—| —]ly,@°d 
S| _3 {eon ae 


lys(t)|? dt 
0 


b 
= [{r, | jt) 
s s=0 


a 


b 


- [eo.xepat 


a 


(4.1.10) 


b 


= / (0), X() dt. 


a 


That (iii) implies (i) follows directly from this identity. To prove that (i) im- 
plies (iv) we argue indirectly and assume that there exists a point fy € [0, 1] such 
that )(¢o) is not orthogonal to T,,(,,)M . Then there exists a vector v9 € Ty) M such 
that (? (to), vo) > 0. We may assume without loss of generality that a < f% <b. 
Then there exists a constant e > 0 such that a < tj) —e < t +e < band 


tp -—E<t<tte => (V(t), T(y(t))vo) > 0. 


Now choose a smooth cutoff function 6 : J — [0, 1] such that 6(t) = Oforallt € J 
with |t — fo| => ¢ and B(t) = 1. Define X € T,Qpq by 


X(t) := B(t) TT (y(t) v0 fort € I. 
Then (y(t), X(¢)) = 0 for all ¢ and (j(to), X(to)) > 0. Hence 


b 
dé()x = - [ (F).X@) at <0 


a 


and so y does not satisfy (i). Thus (i) is equivalent to (iii) and (iv). 
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We prove that (i) is equivalent to (ii). Assume first that y satisfies (i). Then y 
also satisfies (iv) and hence y(t) L 7,@)M for all t € J. This implies 


ld 
0 = (7). 90) = 5 IPO! 


Hence the function J > Ri: th I(t)? is constant. Choose c > 0 such that 
ly(t)| = c. Ifc = 0, then y(f) is constant and so y(t) = p = q. Ifc > 0, then 


b 

i bis(t)| at 

s=08 
0 . 

/ | _laolae 
; 0 

= frortvo. 5 

b 


1 . : 
~ [ eo. Xeohat 


a 


dL(y)X 


d 
ds 
b 


D o) dt 
=0 


Ss 


= 1 RW)X. 
c 


Thus, in the case c > 0, y is an extremal point of E if and only if it is an extremal 
point of L. Hence (i) is equivalent to (ii). 

We prove that (iii) is equivalent to (v). Let (®, y, y’) be a development of 
along M’ = R”. Then 


d 
YO=POVO,  ZFeOXO = POVXO 


for all X € Vect(y) and all t € J. Take X¥ = y to obtain j’(t) = O(t)Vp(t) for 
allt € J. Thus Vy = Oif and only if y’ = 0. This proves Theorem 4.1.4. Oo 


Remark 4.1.3 shows that reparametrization by a nondecreasing surjective 
mapo: /' + I gives rise to map 


QyqgT) > Qogl'): yr you 
which preserves the length functional, i.e. 


L(yoo)=L(y) 
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for all y € §2,,4(/). Thus the chain rule in infinite dimensions should assert that 
if y oo is an extremal (i.e. critical) point of L, then y is an extremal point of L. 
Moreover, if o is a diffeomorphism, the map y + y oa is bijective and should give 
rise to a bijective correspondence between the extremal points of L on 92,,,(7) and 
those on (2,,,(Z'). Finally, if the tangent vector field y vanishes nowhere, then y 
can be parametrized by the arclength. This is spelled out in more detail in the next 
exercise. 


Exercise 4.1.8 Let y : J = [a,b] — M bea smooth curve such that 


y(t) #0 


for all t € J and define 
b 
T= L(y) = f\yolat. 


(i) Prove that there exists a unique diffeomorphism o : [0,7] > J such that 
t 
or)=t >t = | olds 


for all t’ € [0, 7] and all ¢t € [a,b]. Prove that y’:= yoo: [0,7] > M is 
parametrized by the arclength, i.e. |y’(t')| = 1 for all ¢’ € [0, T]. 
(ii) Prove that 


b 
dL(y)X = - / WO,.xXO)d, VO=hOCRO. G1ay 


d 


Hint: See the relevant formula in the proof of Theorem 4.1.4. 

(iii) Prove that y is an extremal point of L if and only if the curve y’ in part (i) is a 
geodesic. 

(iv) Prove that y is an extremal point of L if and only if there exists a geodesic y’ : 
I' > M anda diffeomorphism o : I’ > I such that y’ = yoo. 


Next we generalize this exercise to cover the case where y is allowed to vanish. 
Recall from Remark 4.1.6 that the function s + L(y;) need not be differentiable. 
As an example consider the case where y = yo is constant (see also Exercise 4.4.12 
below). 
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Exercise 4.1.9 Let y : J ~ M beasmooth curve and let X € T,{2)4(/). Choose 
a smooth curve of curves R > Q2)4(1) : 5 +> ys that satisfies (4.1.7). Prove that 
the one-sided derivatives of the function s +> L(y;) exist at s = 0 and satisfy the 
inequalities 


: d ; 
- [Xela = 7s EW) - = [\xol at 


I v6 


Exercise 4.1.10 Let (&, y, y’) be a development of M along M’. Show that y is a 
geodesic in M if and only if y’ is a geodesic in M’. 


4.2 Distance 


Assume that M Cc R"” is a connected smooth m-dimensional submanifold. Two 
points p,q € M are of distance | p — q| apart in the ambient Euclidean space R”. 
In this section we define a distance function which is more intimately tied to M 
by minimizing the length functional over the space of curves in M with fixed end- 
points. Thus it may happen that two points in M have a very short distance in IR” 
but can only be joined by very long curves in M (see Fig. 4.1). This leads to the 
intrinsic distance in M. Throughout we denote by J = [0, 1] the unit interval and, 
for p,q € M, by 


Qyq i= {y : [0,1] > M |y is smooth and y(0) = p, y(1) = g} (4.2.1) 


the space of smooth paths on the unit interval joining p to g. Since M is connected 
the set £2,, is nonempty for all p,g € M. (Prove this!) 


Definition 4.2.1 The intrinsic distance between two points p,g € M is the real 
number d(p,q) = 0 defined by 


d(p,q):= Oo L(y). (4.2.2) 


The inequality d(p,q) = 0 holds because each curve has nonnegative length and 
the inequality d(p,q) < 0 holds because 2, 4 9. 


Fig. 4.1 Curves in M 
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Remark 4.2.2 Every smooth curve y : [0,1] ~ R” with endpoints y(0) = p 
and y(1) = q satisfies the inequality 


1 1 
L(y) = float = [iwa = |p—4l. 
0 0 


Thus d(p, q) => |p — q|. For y(t) := p + t(q — p) we have equality and hence the 
straight lines minimize the length among all curves from p to q. 


Lemma 4.2.3. The function d : M x M -— [0, 00) defines a metric on M: 


Gi) Ifp,q €M satisfy d(p,q) = 0, then p = q. 
(ii) Forall p,q € M we have d(p,q) = d(q, P). 
(iii) For all p,q,r € M we have d(p,r) < d(p,q) + d(q,r). 


Proof By Remark 4.2.2 we have d(p,q) => |p—4q| for all p,q € M and this 
proves part (i). Part (ii) follows from the fact that the curve y(t) := y(1—1f) 
has the same length as y and belongs to 92,,, whenever y € 92,,,. To prove 
part (iii) fix a constant ¢ > 0 and choose curves yp € 82,4 and y; € §2,, such 
that L(yo) < d(p,q) + ¢ and L(y) < d(q,r) + €. By Remark 4.1.3 we may as- 
sume without loss of generality that yo(1 — t) = y,(t) = q for ¢ > 0 sufficiently 
small. Under this assumption the curve 


(t) := yo(2t), for0 <t < 1/2, 
Wwe Vy (2t—1), forl/2<t<1 
is smooth. Moreover, y(0) = p and y(1) =r andsoy € (ome Thus 


d(p,r) < L(y) = L(vwo) + L(Y) < d(p, 9g) + d(q,r) + 2. 


Hence d(p,r) < d(p,q) + d(q,r) + 2¢ for every ¢ > 0. This proves part (iii) and 
Lemma 4.2.3. Oo 


Remark 4.2.4 It is natural to ask if the infimum in (4.2.2) is always attained. This 
is easily seen not to be the case in general. For example, let M result from the 
Euclidean space R” by removing a point po. Then the distance d(p,q) = |p —q| 
is equal to the length of the line segment from p to q and any other curve from 
p to q is longer. Hence if po is in the interior of this line segment, the infimum 
is not attained. We shall prove below that the infimum is attained whenever M is 
complete. 


Example 4.2.5 Let 


M := S? = {peR? | |p| = 1} 
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Fig. 4.2 A geodesic on the 
2-sphere 


be the unit sphere in IR? and fix two points p,q € S*. Then d(p, q) is the length of 
the shortest curve on the 2-sphere connecting p and g. Such a curve is a segment 
on a great circle through p and q (see Fig. 4.2) and its length is 


d(p,q) = cos '((p,q)), (4.2.3) 
where (p,q) denotes the standard inner product, and we have 
O<d(p,q) <7. 


(See Example 4.3.11 below for details.) By Lemma 4.2.3 this defines a metric on S?. 
Exercise: Prove directly that (4.2.3) is a distance function on S?. 


We now have two topologies on our manifold WM Cc R"”, namely the topology 
determined by the metric d in Lemma 4.2.3 and the relative topology inherited 
from R”. The latter is also determined by a distance function, namely the extrinsic 
distance function defined as the restriction of the Euclidean distance function on R” 
to the subset 7. We denote it by 


dy: M x M -— (0,00), do(p,q) ‘= |p — 4. (4.2.4) 


A natural question is if these two metrics d and dp induce the same topology on M. 
In other words is a subset U C M open with respect to do if and only if it is open 
with respect to d? Or, equivalently, does a sequence p, € M converge to po € M 
with respect to d if and only if it converges to po with respect to dy? Lemma 4.2.7 
answers this question in the affirmative. 


Exercise 4.2.6 Prove that every translation of IR” and every orthogonal transfor- 
mation preserves the lengths of curves. 


Lemma 4.2.7 For every po € M we have 


d(P.q) _ 
pg Po |p — | 


Lemma 4.2.8 Let po € M and let $y : Up — 920 be a coordinate chart onto an 
open subset of R such that its derivative dbo(po) : Tp, M — R"” is an orthogonal 
transformation. Then 


i d(p.q) 
WM A... 
pa Po |do(p) — oo(q)| 
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The proofs will be given below. The lemmas imply that the topology M inherits 
as a subset of R”, the topology on M determined by the metric d, and the topology 
on M induced by the local coordinate systems on M are all the same. 


Corollary 4.2.9 For every subset U C M the following are equivalent. 


(i) U is open with respect to the metric d in (4.2.2). 

(ii) U is open with respect to the metric dg in (4.2.4). 

(iii) For every coordinate chart $9 : Uj) > 829 of M onto an open subset 2y9 C R™ 
the set dg(Up NU) is an open subset of R”. 


Proof By Remark 4.2.2 we have 


Ip —q| < d(p.q) (4.2.5) 


for all p,g € M. Thus the identity idy : (M,d) — (M, do) is Lipschitz continuous 
with Lipschitz constant one and so every do-open subset of M is d-open. Con- 
versely, let U C M be a d-open subset of M and let po € U and ¢ > 0. Then, by 
Lemma 4.2.7, there exists a constant 6 > 0 such that all p,q € M with |p — po| <6 
and |q — po| <6 satisfy 


d(p.qy<U+e)|lp—dl. 
Since U is d-open, there exists a constant p > 0 such that 
B,(Po, d) CU. 


With 


p 
"1l+e 
this implies B,, (Po, do) C U. Namely, if p € M satisfies 


Po:= min}3 


IP — pol < po < 4, 


then 


d(p, Po) < 1+ €)|p — pol < (1+ €)p0 < p 


and so p € U. Thus U is do-open and this proves that (i) is equivalent to (ii). 

That (ii) implies (iii) follows from the fact that each coordinate chart go is a 
homeomorphism. To prove that (iii) implies (i), we argue indirectly and assume 
that U is not d-open. Then there exists a sequence p, € M \ U that converges to 
an element po € U. Let ¢o : Up > (2 be a coordinate chart with po € Up. Then 
lim, |ho(Pv) — Go(Po)| = 0 by Lemma 4.2.8. Thus ¢o(Uo M U) is not open and 
so U does not satisfy (iii). This proves Corollary 4.2.9. O 
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Fig. 4.3 Locally, M is the i mt 
graph of f ° 
ae 


Proof of Lemma 4.2.7 By Remark 4.2.2 the estimate | p — q| < d(p, q) holds for 
all p,q € M. The lemma asserts that, for all po € M and all e > 0, there exists a 
do-open neighborhood Up C M of po such that all p,q € Up satisfy 


Ip—q| <d(p.q) < +8)|p— ql. (4.2.6) 


Let po € M and e > 0, and define x : R” > T,,M and y : R” > T M+ by 


X(p) := IT(po)(p — Po). y(p) := (1 — IT (po))(P — Po), 


where IT(po) : R" — T,,M denotes the orthogonal projection as usual. Then the 
derivative of the map x|y:M — T,, M at p= po is the identity on 7,,, M. Hence the 
Inverse Function Theorem 2.2.17 asserts that the map x|y : M — T,,M is locally 
invertible near po. Extending this inverse to a smooth map from T,, M to R” and 
composing it with the map y: M — T,,M +, we obtain a smooth map 


f i T)M > Tp,M~ 
and an open neighborhood W C R" of po such that 
peM = y(p) = f(x(p)) 
for all p € W (see Fig. 4.3). Moreover, by definition the map f satisfies 
f(0)=0€T,M*, df(0)=0:T,,M > Tp,M-. 
Hence there exists a constant 5 > 0 such that, for every x € TM, we have 


df (xy 
Ix}<5 => x4 f(x) € W and |\df(x)|| = — sup 12) og, 
O£X€Tp, M [x| 


Define 
Up := {p EM NW | |x(p)| < 4}. 


Given p,q € Up let y : [0,1] > M be the curve whose projection to the x-axis is 
the straight line joining x(p) to x(q), ie. 


x(y(t)) = x(p) + t(x(q) — x(p)) =: x0), 
V(t) = XY) = F&M) = yO. 
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Then y(t) € Up for all t € [0, 1] and 


t 


L(y) = | ikXO+ y@ldt 
| 


= i et) + df(a)x()| dt 
0 


< ; (1+ lldf(x@)ll) KO at 


0 
<(U+ 2) [lola 
0 


= (1 + €)|x(p) — x(q)| 
= (1+ €)|JT(po)(p — 9) 
= (lL e)|p=Gl. 


Hence d(p,q) < L(y) < (+ )|p —4q| and this proves Lemma 4.2.7. Oo 
Proof of Lemma 4.2.8 By assumption we have 


|dgo(po)v| = |v| 


for all v € T,,M. Fix a constant e > 0. Then, by continuity of the derivative, 
there exists a dp-open neighborhood My C M of po such that for all p € Mo and 
all v € T,M we have 


(1 — €)|dgo(p)v| < |v] S$ 1 + 8)|dgo(p)uI. 


Thus for every curve y : [0, 1] > Mo we have 


(—e)Lifooy)) s L(Y) s + )L(po°y). 


One is tempted to take the infimum over all curves y : [0,1] — Mp joining two 
pints p,q € Mo to obtain the inequality 


(1 — €)|do(p) — g0(9)| < d(p.q) < (1 + €)|¢0(p) — d0(Q)I- (4.2.7) 


However, we must justify these inequalities by showing that the infimum over all 
curves in Mo agrees with the infimum over all curves in M joining the points p 
and q. 

It suffices to show that the inequalities hold on a smaller heighborhood M; C Mo 
of po. Choose such a smaller neighborhood M, such that the open set do9(M)) is a 
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convex subset of §29. Then the right inequality in (4.2.7) follows by taking the curve 
y: [0,1] ~ M, from y(0) = p to y(1) = q such that ¢o o y : [0, 1] > do(M1) isa 
straight line. To prove the left inequality in (4.2.7) we use the fact that Mo is d-open 
by Lemma 4.2.7. Hence, after shrinking M, if necessary, there exists a constant 
r > O such that 


Po © MC B,(po,d) C Bs-(po,d) C Mo. 


Then, for p,g € M, we have d(p,q) < 2r while L(y) => 4r for any curve y from p 
to q which leaves Mo. Hence the distance d(p,q) of p,q € M, is the infimum of 
the lengths L(y) over all curves y : [0,1] > Mp that join y(0) = p to y(1) = q. 
This proves the left inequality in (4.2.7) and Lemma 4.2.8. Oo 


A next question one might ask is the following. Can we choose a coordinate 
chart @ : U — §20nM with values in an open set S2 C R” so that the length of each 
smooth curve y : [0, 1] + U is equal to the length of the curve c := doy: [0,1] > 
Q? We examine this question by considering the inverse map y := @! : Q > U. 
Denote the components of x and w(x) by 


8 =O ons B™) EQ, w(x) = (Wl(x),...,W"(x)) € U. 


Given a smooth curve [0, 1] > 2:tft> c(t) = (cl(t),...,¢’(t)) we can write the 
length of the composition y = y oc: [0,1] > M in the form 


1 
d 
L(y oe) = [pve dt 
0 


n 


d 2 
=} » (Grew) dt 


0 \v=l 


1 n m p 2 
=f alps eee] dt 
\ = x 


=| Dons a (ott) leper (ner (n dt 


1 
=f | Lea cwne wate. 
0 Niv=l 

Here the functions g;; : 82 — R are defined by 


"aw ow” 0 0 
0) => ww = ( s(t se co}, (4.2.8) 


v=1 
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Fig. 4.4 A spherical triangle 


Thus we have a smooth function g = (g;;) : 82 > R”*” with values in the positive 
definite matrices given by g(x) = dw(x)'dw(x) such that 


1 


Li oc) = / Vé(t)'g(c(t))é(t) dt (4.2.9) 


0 


for every smooth curve c : [0,1] — &. Thus the condition L(w oc) = L(c) for 
every such curve is equivalent to 


ij (Xx) = 5; 
for all x € 92 or, equivalently, 
dw (x)'dw(x) = 1. (4.2.10) 


This means that wy preserves angles and areas. The next example shows that for 
M = S? it is impossible to find such coordinates. 


Example 4.2.10 Consider the manifold M = S?. If there is a diffeomorphism w : 
92 — U from an open set 2 C R? onto an open set U C S? that satisfies (4.2.10), it 
has to map straight lines onto arcs of great circles and it preserves the area. However, 
the area A of a spherical triangle bounded by three arcs on great circles satisfies the 
angle sum formula 


at+P+y=an2+A. 


(See Fig. 4.4.) Hence there can be no such map yy. 


4.3 The Exponential Map 


Geodesics give rise to a flow on the tangent bundle, the geodesic flow. Itis generated 
by a vector field on the tangent bundle, called the geodesic spray. The time-1-map 
of the geodesic flow gives rise to the exponential map. 


4.3.1 Geodesic Spray 


The tangent bundle 7M is a smooth 2m-dimensional manifold in R” x R” by 
Corollary 2.6.12. The next lemma characterizes the tangent bundle of the tangent 
bundle. Compare this with Lemma 3.4.5. 
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Lemma 4.3.1 The tangent space of TM at (p,v) € TM is given by 


Pp €T,M and 


ns =I. St (4.3.1) 


Tp») TM = a0 eR" x R" 


Proof We prove the inclusion “C” in (4.3.1). Let (D,¥) € T(p,»)TM and choose a 
smooth curve R > TM :t + (y(t), X(t)) such that 


yO=p, XO=% yO=7 *O=v 


Then X = VX + h,(y, X) by the GauB-Weingarten formula (3.2.2) and hence 
(1 — M(y(t)))X(t) = hyn (v(t), X(t) for all ¢ € R. Take ¢ = 0 to obtain 
(1 — 1(p))v = h,(p, v). This proves the inclusion “C” in (4.3.1). Equality holds 
because both sides of the equation are 2m-dimensional linear subspaces of R” x R”. 

Oo 


By Lemma 4.3.1 a smooth map S = (S$), $2): TM — R"” x R" is a vector field 
on 7M if and only if 


Si(p,v)€T,M, — (1-H (p))So(p, v) = hp (Si(p. v), v) 


for all (p, v) € TM. A special case is where S;(p, v) = v. Such vector fields cor- 
respond to second order differential equations on M. 


Definition 4.3.2 (Spray) A vector field S € Vect(7M) is called a spray iff it has 
the form S(p, v) = (v, S2(p, v)) where Sy : TM — R" is asmooth map satisfying 


(1 — (p))S2(p, v) = Ap(v, v), So(p, Av) = A? S2(p, v) (4.3.2) 
for all (p,v) € TM and A € R. The vector field S € Vect(7M) defined by 
S(p,v):= (v,hy(v,v))  € Toy »TM (4.3.3) 


for p € M and v € T,M is called the geodesic spray. 


4.3.2 The Exponential Map 


Lemma 4.3.3. Let y : 1 — M beasmooth curve on an open interval I C R. Then 
y is a geodesic if and only if the curve I > TM :t ® (y(t), y(t)) is an integral 
curve of the geodesic spray S in (4.3.3). 


Proof Asmoothcurve 1 — TM :tt> (y(t), X(¢)) is an integral curve of S if and 
only if y(t) = X(t) and X (t) = hy (X(t), X()) for all t € J. By equation (4.1.5), 
this holds if and only if y is a geodesic and y = X. Oo 


Combining Lemma 4.3.3 with Theorem 2.4.7 we obtain the following existence 
and uniqueness result for geodesics. 
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Lemma 4.3.4 Let M C R" be an m-dimensional submanifold. 


(i) For every p € M and every v € T,M there is an € > 0 and a smooth curve 
y : (—e,€) > M such that 


Vy =0, y(0) = p, y(0) =v. (4.3.4) 
Gi) fy: Ll) > M and yz: In > M are geodesics and to € I, N Ip with 
Vito) = Y2(to), Vi (to) = 2(to), 
then y(t) = y2(t) for allt € 1, N In. 


Proof Lemma 4.3.3 and Theorem 2.4.7. Oo 


Definition 4.3.5 (Exponential map) For p ¢ M and v € T,,M the interval 


T is an open interval containing 0 and there is a 


ICR . ae : . 
geodesic y : I > M satisfying y(0) = p, y(0) = v 


Tp t= 'o 


is called the maximal existence interval for the geodesic through p in the direction 
v. For p € M define the set V, C T,M by 


V,:= {ve T,M|1e 1}. (4.3.5) 
The exponential map at p is the map exp, : V, > M that assigns to every tangent 
vector u € V, the point exp, (v) := y(1), where y : J,,, > M is the unique geodesic 
satisfying y(0) = p and y(0) = v. 
Lemma 4.3.6 
(i) The set 


V :={(p,v)|peM,veV,}CTM 


is open and the map V — M : (p,v) +> exp, (v) is smooth. 
Gi) Ifp € M and v € V,, then 


Ip» = {t ER|tv e V,} 
and the geodesic y : In, — M with y(0) = p and y(0) = v is given by 


y(t) = exp, (tv), telny. 
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Fig. 4.5 The exponential 
map 


Proof Part (i) follows directly from Lemma 4.3.3 and Theorem 2.4.9. To prove 


part (ii), fix an element p € M and a tangent vector v € V,, and let y: J,, > M 


be the unique geodesic with y(0) = p and y(0) = v. Fix a nonzero real number A 
and define the map y, : A~!J,,, > M by 


Pw 
y(t) = y(At) fort € pha here 
Then y(t) = Ap(At) and ¥,(t) = A?P (At) and hence 
Vad) = Waar =VA(yAt))yAt) = VVyAt) = 0 
for everyt €A'T p.v- This shows that y, is a geodesic with 


yn(0) = p, y,(0) = Av. 


In particular, we have A~! J p.v © Ip,y. Interchanging the roles of v and Av we obtain 
ATT = Lpjigv THUS 


AE Thy = Le Thay = Ave V, 
and 
y(A) = ya) = exp, (Av) 
for A € Ip. This proves Lemma 4.3.6. Oo 


Since exp, (0) = p by definition, the derivative of the exponential map at v = 0 
is a linear map from 7; M to itself. This derivative is the identity map as illustrated 
in Fig. 4.5 and proved in the following corollary. 


Corollary 4.3.7 The map exp, : V, > M is smooth and its derivative at the origin 
is d exp, (0) = id: T,M > T,M. 


Proof The set V, is an open subset of the linear subspace T, M C R", with respect 
to the relative topology, and hence is a manifold. The tangent space of V, at each 
point is 7, M. By Lemma 4.3.6 the exponential map exp, : V, > M is smooth and 
its derivative at the origin is given by 


d : 
d exp, (0)u = ai - exp, (tv) = y(0) = v, 


where y : I), — M is once again the unique geodesic through p in the direction v. 
This proves Corollary 4.3.7. O 
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Corollary 4.3.8 Let p € M and, forr > 0, denote 
B,(p) := {v € T)M | |v| <r}. 
Ifr > 0 is sufficiently small, then B,(p) C Vp, the set 
U;(p) := exp,(B;(p)) 


is an open subset of M, and the restriction of the exponential map to B,(p) is a 
diffeomorphism from B,(p) to U,(p). 


Proof This follows directly from Corollary 4.3.7 and Theorem 2.2.17. oO 


Definition 4.3.9 (Injectivity radius) Let M Cc R” be a smooth m-manifold. The 
injectivity radius of M at p is the supremum of all real numbers r > 0 such 
that B,(p) C V, and the restriction of the exponential map exp, to B,(p) is a dif- 
feomorphism onto its image 


U,(p) := exp, (B,(p)). 


It will be denoted by 


B,(p) C V, and 
inj(p) := inj(p; M) := supyr > 0) exp, : B-(p) > U,(p) 
is a diffeomorphism 


The injectivity radius of M is the infimum of the injectivity radii of M at p over 
all p € M. It will be denoted by 


inj(M) := au inj(p; M). 


4.3.3 Examples and Exercises 
Example 4.3.10 The exponential map on R” is given by 
exp,(v) = pt+u for p,v € R”. 


For every p € R” this map is a diffeomorphism from 7,]R” = R” to R” and hence 
the injectivity radius of R” is infinity. 
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Example 4.3.11 The exponential map on S$” is given by 


sin(|v|) 


lv 


exp,(v) = cos(|v|) p + v 


for every p € S” and every nonzero tangent vector v € 7,S” = p-. The restric- 
tion of this map to the open ball of radius r in TM is a diffeomorphism onto its 
image if and only if r < z. Hence the injectivity radius of S” at every point is z. 

Exercise: Given p € S” and 0 4 v € 7,8" = p+, prove that the geodesic 
y:R— S” with y(0) = p and y(0) = v is given by 


sin(t|v|) 


y(t) = cos(t|v|)p + rol 


for t € R. Show that in the case 0 < |u| < z there is no shorter curve in S$” con- 
necting p and q := y(1) and deduce that the intrinsic distance on S” is given by 


d(p.q) = cos”'((p.4)) 
for p,q € S” (see Example 4.2.5 for m = 2). 
Example 4.3.12 Consider the orthogonal group O(7) C R”*” with the stan- 


dard inner product (v,w) := trace(v'w) on R"*”". The orthogonal projection 
IT(g): R"" > T, O(n) is given by 


IT(g)v := 50 —gv'g) 
and the second fundamental form by 
hg(v,v) = —gou'v. 
Hence a curve y : R + O(n) is a geodesic if and only if y'¥ + ~'y = 0 or, equiv- 


alently, y'y is constant. This shows that geodesics in O(7) have the form y(t) = 
gexp(t&) for g € O(m) and & € o(). It follows that 


exp,(v) = g exp(g'v) = exp(ug')g 


for g € O(n) and v € T,O(n). In particular, for g = 1 the exponential map 
exp, : 0(”) — O(n) agrees with the exponential matrix. 


Exercise 4.3.13 What is the injectivity radius of the 2-torus T* = S! x S!, the 
punctured 2-plane R? \ {(0, 0)}, and the orthogonal group O(n)? 
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4.3.4 Geodesics in Local Coordinates 


Lemma 4.3.14 Let M C R" be an m-dimensional manifold and choose a coordi- 
nate chart ¢ : U — QQ with inverse 


wi=G@': QU. 


Let rk : 2 — R be the Christoffel symbols defined by (3.6.6) and letc: I > 2 
be a smooth curve. Then the curve 


yi=ypoc:Il>~M 
is a geodesic if and only if c satisfies the 2nd order differential equation 
e+ \° r@e'e! =0 (4.3.6) 
ij=l 


fork =1,...,m. 


Proof This follows immediately from the definition of geodesics and equa- 
tion (3.6.7) in Lemma 3.6.1 with X¥ = yp and& = ¢. Oo 


We remark that Lemma 4.3.14 gives rise to another proof of Lemma 4.3.4 that 
is based on the existence and uniqueness of solutions of second order differential 
equations in local coordinates. 


Exercise 4.3.15 Let 2 CR” be an open set and g = (g;;): 2 > R”*” be asmooth 
map with values in the space of positive definite symmetric matrices. Consider the 
energy functional 


1 
E(c):= [ Heo.co dt 
0 
on the space of paths c : [0,1] ~ @, where L : Q x R” —> R is defined by 


m 


L(x, €) := ; >> Fay (ae. (4.3.7) 


ij=l 
The Euler-Lagrange equations of this variational problem have the form 


Serle.) = 2 (c(t). (0) k=1,...,m. (4.3.8) 


Prove that the Euler-Lagrange equations (4.3.8) are equivalent to the geodesic equa- 
tions (4.3.6), where the Dy : Q — R are given by (3.6.10). 
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4.4 Minimal Geodesics 


Any straight line segment in Euclidean space is the shortest curve joining its end- 
points. The analogous assertion for geodesics in a manifold M is false; consider for 
example an arc which is more than half of a great circle on a sphere. In this section 
we consider curves which realize the shortest distance between their endpoints. 


4.4.1. Characterization of Minimal Geodesics 


Lemma 4.4.1 Let I = [a,b] be a compact interval, let y : I — M be a smooth 
curve, and define p := y(a) and q := y(b). Then the following are equivalent. 


(i) y is parametrized proportional to the arclength, i.e. |y(t)| = c is constant, 
and y minimizes the length, i.e. L(y) < L(y’) for every smooth curve y' in M 
Joining p and q. 

(ii) y minimizes the energy, i.e. E(y) < E(y’) for every smooth curve y’ in M 
joining p and q. 


Definition 4.4.2 (Minimal geodesic) A smooth curve y : J — M ona compact 
interval J C R is called a minimal geodesic iff it satisfies the equivalent conditions 
of Lemma 4.4.1. 


Remark 4.4.3 


(i) Condition (i) says that (the velocity || is constant and) L(y) = d(p,q), i.e. 
that y is a shortest curve from p to q. It is not precluded that there be more 
than one such y; consider for example the case where M is a sphere and p 
and q are antipodal. 

(ii) Condition (ii) implies that 


d 
—| E(y,) =0 
Te (ys) 


for every smooth variation R x J > M:s +> y,(t) of y with fixed endpoints. 
Hence a minimal geodesic is a geodesic. 

(iii) Finally, we remark that L(y) (but not E'(y)) is independent of the parametriza- 
tion of y. Hence, if y is a minimal geodesic, then L(y) < L(y’) for every y’ 
(from p to qg) whereas E(y) < E(y’) for those y’ defined on (an interval the 
same length as) J. 


Proof of Lemma 4.4.1 We prove that (i) implies (11). Let c be the (constant) value 
of | (t)|. Then 


b-— 2 
Ly) =(-aje, Ey) = tas 
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Then, for every smooth curve y’: J > M with y’(a) = p and y'(b) = q, we have 


4E(y) = 7? L(y)? 
<7L(y'? 
2 


b 
= ¢? / Wl at 
b 


<eb—a) fOPar 


= 2(b—a)c?E(y’) 
= 4E(y)E(y’). 


Here the fourth step follows from the Cauchy—Schwarz inequality. Now divide 
by 4E(y) to obtain E(y) < E(y’). 

We prove that (ii) implies (i). We have already shown in Remark 4.4.3 that (ii) 
implies that y is a geodesic. It is easy to dispose of the case where M is one- 
dimensional. In that case any y minimizing E(y) or L(y) must be monotonic onto 
a subarc; otherwise it could be altered so as to make the integral smaller. Hence sup- 
pose M is of dimension at least two. Suppose, by contradiction, that L(y’) < L(y) 
for some curve y’ from p to q. Since the dimension of M is bigger than one, 
we may approximate y’ by a curve whose tangent vector nowhere vanishes, i.e. 
we may assume without loss of generality that p’(t) ~ 0 for all t. Then we can 
reparametrize y’ proportional to arclength without changing its length, and by a 
further transformation we can make its domain equal to J. Thus we may assume 
without loss of generality that y’ : J > M is a smooth curve with y’(a) = p and 
y'(b) = q such that |y’(t)| = c’ and 


(b—a)c' = L(y’) < L(y) = (b—-a)c. 
This implies c’ < c and hence 


(b—a)c” (b—a)c? 
= < = 


E(y’) ; 5 


E(y). 


This contradicts (ii) and proves Lemma 4.4.1. Oo 


4.4.2 Local Existence of Minimal Geodesics 


The next theorem asserts the existence of minimal geodesics joining two points 
that are sufficiently close to each other. It also shows that the set U,(p) = 
exp,(B,(p)) that was introduced in Definition 4.3.9 is actually the open ball 
U,(p) = {q € M|d(p,q) <r} whenever r < inj(p; M). 
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Fig. 4.6 The Gau8 Lemma Ur 


Theorem 4.4.4 (Existence of minimal geodesics) Let M C R” be a smooth m- 
manifold, fix a point p € M, and let r > 0 be smaller than the injectivity radius of 
M at p. Letv € T,M such that |v| < r. Then 


d(p.q)=|v|], — ¢ := exp,(v), 


and acurve y € Qy 4 has minimal length L(y) = |v| if and only if there is a smooth 
map B : [0,1] — [0, 1] satisfying 


pO =0, BU)=1, p20 
such that y(t) = exp, (B(t)v) for0 <t <1. 
The proof is based on the following lemma. 


Lemma 4.4.5 (Gau8 Lemma) Let M, p, r be as in Theorem 4.4.4, let I C R be 
an open interval, and let w : I — V, be a smooth curve whose norm 


|w(t)| =: 7 
is constant. Define 
a(s,t) := exp, (sw(t)) 


for (s,t) € R x I with sw(t) € Vp. Then 
da da\ _ ; 
as’ otf 


Thus the geodesics through p are orthogonal to the boundaries of the embedded 
balls U,(p) in Corollary 4.3.8 (see Fig. 4.6). 


Proof of Lemma 4.4.5 For every t € I we have 
a(0,t) = exp,(0) = p 


and so the assertion holds for s = 0, i.e. 


dat da 
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Moreover, each curve s +> a(s,f) is a geodesic, 1.e. 


da 07a 
V;— = (a)—~ =0. 
* OS Os? ° 


By Theorem 4.1.4, the function 


da 
st> |—(s,t) 
ds 
is constant for every f, so that 


we s.0) =|w(t)| =r for (s,t)E Rx J. 
s 


da 
= oo 


This implies 


8 (aw aa _ 
ds\ as’ or] 


iu da 
ds Ost 
14 |da\" 


~ 2 ar] ds 
= 0. 


Since the function (2, amy vanishes for s = 0 we obtain 


da oa 
(S00, et] =0 


for all s and t. This proves Lemma 4.4.5. Oo 


Proof of Theorem 4.4.4 Let r > 0 be as in Corollary 4.3.8 and let v € 7,M such 


that 0 < |v| =: e < r. Denote q := exp, (v) and let y € 2,4. Assume first that 


y(t) € exp,(B.(p)) =U. Vere [0,1]. 


Then there is a unique smooth function [0, 1] > T,M :t +> v(t) such that |v(t)| <e 
and y(t) = exp,(u(t)) for every t. The set 


P= {te 0, Nlv© # pt = tt € [0,1] |v@ F 0} c ©, 1] 
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is open in the relative topology of (0, 1]. Thus J is a union of open intervals in (0, 1) 
and one half open interval containing 1. Define f : [0, 1] > [0, 1] andw: 1 > 7,M 
by 


_ lW@| _ vit) 
B(t) := aot w(t) = Teal 


Then £ is continuous, both 6 and w are smooth on J, 
B(O) = 9, BQ) = 1, w(1) =, 
and 
lwl=e, y(t) = exp, (B)w(t)) 


for allt € I. We prove that L(y) > e. To see this let w : [0,1] x J > M be the map 
of Lemma 4.4.5, i.e. 


a(s,t) := exp, (sw(t)). 


Then y(t) = a(A(t), t) and hence 


, 0 0 
V0) = BOF BOD + FBO. 


for every t € J. Hence it follows from Lemma 4.4.5 that 


. (0 > ja a 
Hor = ber Z@O.0] +|E@eO.0) = bore? 


for every t € J. Hence 


1 
L(y) = | ly@ldt = [|y@ldt=e | /BOldt>=e | B)dt =e. 
n= finolar= fivoidre« [polar | soar =- 


Here the last equality follows by applying the fundamental theorem of calculus to 
each interval in J and using the fact that 6(0) = 0 and 6(1) = 1. If L(y) = ¢, we 
must have 


dat ‘ 
5, PON =0, B20 for allt € I. 
Thus / is a single half open interval containing | and on this interval the condition 


de (B(t), t) = 0 implies w(t) = 0. Since w(1) = v we have w(t) = v for every 
t € I. Hence y(t) = exp,(A(t)v) for every ¢ € [0, 1]. It follows that 6 is smooth 
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on the closed interval [0, 1] (and not just on /). Thus we have proved that every 
y € Qyq with values in U, has length L(y) > ¢ with equality if and only if y is 
a reparametrized geodesic. But if y € {2,, does not take values only in U,, there 
must be a T € (0, 1) such that y([0, T]) C U, and y(T) € AU,. Then L(y|,o,7}) = €, 
by what we have just proved, and L(y|;7,1)) > 0 because the restriction of y to [T, 1] 
cannot be constant; so in this case we have L(y) > e. This proves Theorem 4.4.4. 

Oo 


The next corollary gives a partial answer to our problem of finding length mini- 
mizing curves. It asserts that geodesics minimize the length locally. 


Corollary 4.4.6 Let M CR" be a smooth m-manifold, let I C R be an open 
interval, and let y: I — M be a geodesic. Fix a point ty € I. Then there exists a 
constant € > 0 such that 
ly -E<s<t<tt+e6 = L(y|s.c—) = €(v(S), y@))- 
Proof Since y is a geodesic its derivative has constant norm |y(t)| = c (see Theo- 
rem 4.1.4). Choose 6 > 0 so small that the interval [f9 — 6, fg + 4] is contained in J. 
Then there is a constant r > 0 such that r < inj(y(t)) whenever |t — fo| < 6. Choose 
€ > 0 such that 
ée<o6, 2ec <r. 
Iff»-—e<s<t<t%+e, then 
y(t) = exp,s)(¢ — s)y(s)) 
and 
I(t — s)p(s)| = |t —sle < 2ec <r < inj(y(s)). 
Hence it follows from Theorem 4.4.4 that 
L(y|[s) = It — slc = d(y(s), y(@). 


This proves Corollary 4.4.6. O 


4.4.3 Examples and Exercises 


Exercise 4.4.7 How large can the constant ¢ in Corollary 4.4.6 be chosen in the 
case M = S*? Compare this with the injectivity radius. 
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Remark 4.4.8 We conclude from Theorem 4.4.4 that 


S,(p) = {¢ € M|d(p,q) =r} = exp,({v € T,M | |v| = r}) (4.4.1) 


for 0<r <inj(p;M). The Gaui Lemma 4.4.5 shows that the geodesic rays 
[0, 1] + M : s +> exp, (sv) emanating from p are the orthogonal trajectories to the 
concentric spheres S,(p). 


Exercise 4.4.9 Let 
M CR 


be of dimension two and suppose that M is invariant under the (orthogonal) reflec- 
tion about some plane E C R?. Show that EF intersects M in a geodesic. (Hint: 
Otherwise there would be points p,g € M very close to one another joined by two 
distinct minimal geodesics.) Conclude for example that the coordinate planes inter- 
sect the ellipsoid (x/a)* + (y/b)? + (z/c)? = 1 in geodesics. 


Exercise 4.4.10 Choose geodesic normal coordinates near p € M via 


q = exp, (>: vo). 


i=1 


where é),..., € 1S an orthonormal basis of T, M (see Corollary 4.5.4 below). Then 
we have x'(p) = 0 and 


m 


B,(p) = {¢ € M|d(p,q) <r} = I eM | Y|xi@|’ < r| (4.4.2) 
i=1 


for 0 < r < inj(p; M). Hence Theorem 4.5.3 below asserts that B,(p) is convex 
for r > 0 sufficiently small. 


(i) Show that it can happen that a geodesic in B,(p) is not minimal. Hint: Take M 
to be the hemisphere {(x, y,z) € R? |x? + y? +z? = 1, z > 0} together with 
the disc {(x, y,z) € R*|x? + y? < 1, z = 0}, but smooth the corners along 
the circle x? + y? = 1, z = 0. Take p = (0,0, 1) andr = 27/2. 

(ii) Show that, if r > 0 is sufficiently small, then the unique geodesic y in B,(p) 
joining two points qg,g’ € B,(p) is minimal and that in fact any curve y’ 
from g to g’ which is not a reparametrization of y is strictly longer, i.e. 
L(y’) > L(y) = dq, q’). 
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Exercise 4.4.11 Let y: J =[a,b] > M beasmooth curve with endpoints y(a) = p 
and y(b) = q and nowhere vanishing derivative, i.e. y(t) ~ 0 for all t € 7. Prove 
that the following are equivalent. 


(i) The curve y is an extremal of the length functional, i.e. every smooth 
map Rx l > M :(s,t) y,;(t) with y,;(a) = p and y;(b) = q for all s 
satisfies 


d 
mal) = 0. 
qs (ys) 


s=0 


(ii) The curve y is a reparametrized geodesic, i.e. there exists a smooth map o : 
[a,b] — [0,1] with o(a) = 0, o(b) = 1, o(t) = O for all t € J, and a vec- 
tor v € T,M such that 


q = exp,(v), y(t) = exp, (o(¢)v) 


for allt € 7. (We remark that the hypothesis y(t) 4 0 implies that o is actually 
a diffeomorphism, i.e. a(t) > 0 for all ¢ € J.) 

(iii) The curve y minimizes the length functional locally, i.e. there exists an e > 0 
such that L(y|j5,11) = d(y(s), y(¢)) for every closed subinterval [s,t] C J of 
length t —s <e. 


It is often convenient to consider curves y where y(t) is allowed to vanish for 
some values of ft; then y cannot (in general) be parametrized by arclength. Such a 
curve y : I — M can be smooth (as a map) and yet its image may have corners 
(where y necessarily vanishes). Note that a curve with corners can never minimize 
the distance, even locally. 


Exercise 4.4.12 Show that conditions (ii) and (111) in Exercise 4.4.11 are equiv- 
alent, even without the assumption that y is nowhere vanishing. Deduce that, 
if y : 1 > M is a shortest curve joining p to q, i.e. L(y) = d(p,q), then y is a 
reparametrized geodesic. 

Show by example that one can have a variation {ys}seR of a reparametrized 
geodesic yo = y for which the map s +> L(y,) is not even differentiable at s = 0. 
(Hint: Take y to be constant. See also Exercise 4.1.9.) 

Show, however, that conditions (i), (ii) and (iii) in Exercise 4.4.11 remain 
equivalent if the hypothesis that y is nowhere vanishing is weakened to the hy- 
pothesis that y(t) 4 O for all but finitely many t € J. Conclude that a broken 
geodesic is a reparametrized geodesic if and only if it minimizes arclength locally. 
(A broken geodesic is a continuous map y : J = [a,b] — M for which there ex- 
ista = t) <t) <-++- <4, = bsuch that y|p,_, 4.) is a geodesic for? = 1,...,n. Itis 
thus a geodesic if and only if 7 is continuous at the break points, i.e. y(¢7) = y(t;") 
fori = 1,...,n—1.) 
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4.5 Convex Neighborhoods 


A subset of an affine space is called convex iff it contains the line segment joining 
any two of its points. The definition carries over to a submanifold M of Euclidean 
space (or indeed more generally to any manifold M equipped with a spray) once 
we reword the definition so as to confront the difficulty that a geodesic joining two 
points might not exist nor, if it does, need it be unique. 


Definition 4.5.1 (Geodesically convex set) Let M CR” be a smooth m- 
dimensional manifold. A subset U C M is called geodesically convex iff, for 
all po, pi € U, there exists a unique geodesic y : [0,1] — U such that y(0) = po 
and y(1) = pr. 


It is not precluded in Definition 4.5.1 that there be other geodesics from p to q 
which leave and then re-enter U, and these may even be shorter than the geodesic 
inU. 


Exercise 4.5.2 


(a) Find a geodesically convex set U ina manifold M and points po, p; € U such 
that the unique geodesic y : [0,1] ~ U with y(0) = po and y(1) = p; has 
length L(y) > d(po, p1). Hint: An interval of length bigger than z in S!. 

(b) Find a set U in a manifold M such that any two points in U can be joined by 
a minimal geodesic in U, but U is not geodesically convex. Hint: A closed 
hemisphere in $7. 


Theorem 4.5.3 (Convex Neighborhood Theorem) Let M C R” be a smooth m- 
dimensional submanifold and fix a point po € M. Let 6: U > 92 be any coordinate 


chart on an open neighborhood U C M of po with values in an open set 82 CR’. 
Then the set 


U, = tp € U | |b(p) — (Pol < 75 (4.5.1) 
is geodesically convex for r > 0 sufficiently small. 
Before giving the proof of Theorem 4.5.3 we derive a useful corollary. 


Corollary 4.5.4 Let M CR" be a smooth m-manifold and let py € M. Then, 
forr > 0 sufficiently small, the open ball 


U,(po) := {p € M | d(po, Pp) <r} (4.5.2) 


is geodesically convex. 
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Proof Choose an orthonormal basis e1,...,@m Of Tp, M and define 


2 := {x € R™| |x| < inj(po; M)}, 
U := {p € M|d(po, Pp) < inj(po: M)}. 


Define the map w : 22 > U by 


W(x) := exp, (>: ve) (4.5.4) 


i=1 


(4.5.3) 


for x = (x!,...,x’) € 2. Then w is a diffeomorphism and d(po, W(x)) = |x| for 
all x € §2 by Theorem 4.4.4. Hence its inverse 


g@:=w!:U>2 (4.5.5) 
satisfies #(po) = 0 and |f(p)| = d(po, p) for all p € U. Thus 


U,(Po) = {Pp € U||o(p) — $(Po)| <7} for0 <r < inj(po; M) 


and so Corollary 4.5.4 follows from Theorem 4.5.3. Oo 


Definition 4.5.5 (Geodesically normal coordinates) The coordinate chart ¢ : 
U — @ in (4.5.4) and (4.5.5) sends geodesics through pp to straight lines through 
the origin. Its components x!,...,x’" : U > R are called geodesically normal 
coordinates at po. 


Proof of Theorem 4.5.3 Assume without loss of generality that @(po) = 0. Let 
re : 92 — R be the Christoffel symbols of the coordinate chart and, for x € 92, 
define the quadratic function QO, : R” — R by 


(6) = (EY — D> xt rtooe'e/. 


k=1 ijk=l 


Shrinking U, if necessary, we may assume that 


1 
< — forallx € 2. 
2m 


Then, for all x € Q and all € € R” we have 


m 


2 
ett Se) 2 ees 
Ox(E) = (é| z(2 aes 


Hence Q, is positive definite for every x € 92. 
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Now let y : [0,1] ~ U be a geodesic and define 


c(t) = o(y(@)) 


for 0 < t < 1. Then, by Lemma 4.3.14, c satisfies the differential equation 


+ YS rk(eydie =0. 
ij 
Hence 
lel? 


ps — yal? a _ (c ae 
ee gi ere oe 


and so the function t b& lp(y(t)) | is convex. Thus, if y(0), y(1) € U, for some 
r > 0, it follows that y(t) € U, for all t € [0, 1]. 
Consider the exponential map 
V = {(p,v) € TM |v € V,} > M: (p,v) & exp, (v) 
in Lemma 4.3.6. Its domain V is open and the exponential map is smooth. Since 
it sends the pair (po,0) € V to exp,,(0) = po € U, it follows from continuity that 
there exist constants ¢ > 0 andr > 0 such that 
péeu,, veET,M, |r| <e => veV,, exp,(v)eU. (4.5.6) 
Moreover, we have 
d exp,,(0) = id: T,,M — Tp.M 
by Corollary 4.3.7. Hence the Implicit Function Theorem 2.6.15 asserts that the 
constants ¢ > 0 and r > 0 can be chosen such that (4.5.6) holds and there exists a 
smooth map /: U, x U, — R" that satisfies the conditions 
h(p.q) € TpM, lh(p.q)| <€ (4.5.7) 
for all p,q € U, and 
exp,(v) = q — v =h(p,q) (4.5.8) 
for all p,g € U, and all v € T,M with |v| < e. In particular, we have 


h(po, Po) = 0 


and exp, (h(p,q)) = q for all p,q € U,. 
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Fix two constants ¢ > 0 and r > 0 and a smooth map 1: U, x U, — R” such 
that (4.5.6), (4.5.7), (4.5.8) are satisfied. We show that any two points p,q € U, are 
joined by a geodesic in U,. Let p,q € U, and define 


y(t) := exp, (th(p,q)) forO <t <1. 


This curve y : [0,1] > M is well defined by (4.5.6) and (4.5.7), it is a geodesic 
satisfying y(0) = p € U, by Lemma 4.3.6, it satisfies y(1) = qg € U, by (4.5.8), it 
takes values in U by (4.5.6) and (4.5.7), and so y([0, 1]) C U, because the func- 
tion [0,1] > R: tb |(y(t))/’ is convex. 

We show that there exists at most one geodesic in U, joining p and q. 
Let p,g € U, and let y : [0,1] > U, be any geodesic such that y(0) = p and 
y(1) =q. Define v:= y(0) €7,M. Then y(t) = exp,(tv) for O<t <1 by 
Lemma 4.3.6. We claim that |v| < ¢. Suppose, by contradiction, that 


|u| > «. 
Then 


T:= ae 
|v| 


and, for 0 < t < T, we have |tv| < ¢ and exp, (tv) = y(t) € U, and so 
h(p, y(t)) = tv. 
by (4.5.8). Thus 
|h(p, y(t))| = tlv| for0<t <T. 

Take the limit t 7 T to obtain 

ln(p. v(T))| = Tv] =e 
in contradiction to (4.5.7). This contradiction shows that |v| < e. Since 

exp,(v) = y(1) = 4 € U, 
it follows from (4.5.8) that v = h(p,q). This proves Theorem 4.5.3. O 


Remark 4.5.6 Theorem 4.5.3 and its proof carry over to general sprays (see Defi- 
nition 4.3.2). 


Exercise 4.5.7. Consider the set U,(p) = {q ¢ M|d(p,q) <r} for pe M 
and r > 0. Corollary 4.5.4 asserts that this set is geodesically convex for r suf- 
ficiently small. How large can you choose r in the cases 


M =S?, M=T?=S'xsS'}, M =R’, M = R?’ \ {0}. 


Compare this with the injectivity radius. If the set U,(p) in these examples is 
geodesically convex, does it follow that every geodesic in U,(p) is minimizing? 
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4.6 Completeness and Hopf-Rinow 


For a Riemannian manifold there are different notions of completeness. First, 
in Sect. 3.4 completeness was defined in terms of the completeness of time depen- 
dent basic vector fields on the frame bundle (Definition 3.4.10). Second, there is a 
distance function 


d:MxM = (0,c) 


defined by equation (4.2.2) so that we can speak of completeness of the metric 
space (M, d) in the sense that every Cauchy sequence converges. Third, there is the 
question of whether geodesics through any point in any direction exist for all time; 
if so we call a Riemannian manifold geodesically complete. The remarkable fact 
is that these three rather different notions of completeness are actually equivalent 
and that, in the complete case, any two points in M can be joined by a shortest 
geodesic. This is the content of the Hopf—Rinow theorem. We will spell out the 
details of the proof for embedded manifolds and leave it to the reader (as a straight 
forward exercise) to extend the proof to the intrinsic setting. 


4.6.1 Geodesic Completeness 


Definition 4.6.1 (Geodesically complete manifold) Let M C R” be an m- 
dimensional manifold. Given a point p € M we say that M is geodesically com- 
plete at p iff, for every tangent vector v € 7, M, there exists a geodesic y : R > M 
(on the entire real axis) satisfying y(O) = p and y(0) =v (or equivalently 
V, = T,M where V, C T,M is defined by (4.3.5)). The manifold M is called 
geodesically complete iff it is geodesically complete at every point p € M. 


Definition 4.6.2 Let (/, d) be a metric space. A subset A C M is called bounded 
iff 


sup d(p, po) < © 
pea 


for some (and hence every) point po € M. 


Example 4.6.3. A manifold M@ C R” can be contained in a bounded subset of 
IR” and still not be bounded with respect to the metric (4.2.2). An example is the 
1-manifold M = {(x, y) €R*7|0<x<l,y= sin(1/x)}. 


Exercise 4.6.4 Let (M,d) be a metric space. Prove that every compact subset 
K C M is closed and bounded. Find an example of a metric space that contains a 
closed and bounded subset that is not compact. 
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Theorem 4.6.5 (Completeness) Let M C R” be a connected m-dimensional man- 
ifold and let d: M x M — (0, ov) be the distance function defined by (4.1.1), (4.2.1), 
and (4.2.2). Then the following are equivalent. 


(i) M is geodesically complete. 

(ii) There exists a point p € M such that M is geodesically complete at p. 

(iii) Every closed and bounded subset of M is compact. 

(iv) (M, d) is a complete metric space. 

(v) M is complete, i.e. for every smooth curve —&:R — R” and every ele- 
ment (po, eo) € F(M) there exists a smooth curve B : R > F(M) satisfying 


B(t) = Be(BO), BO) = (po, €0). (4.6.1) 


(vi) The basic vector field B; € Vect(F (M)) is complete for every § € R". 

(vii) For every smooth curve y’ : R + R", every po € M, and every orthogonal 
isomorphism ®y : T,,M — IR” there exists a development (®, y, y') of M 
along R” on all of R that satisfies y(O) = po and ®(0) = ®p. 


Proof The proof relies on Theorem 4.6.6 below. O 


4.6.2 Global Existence of Minimal Geodesics 


Theorem 4.6.6 (Hopf—Rinow) Let M C R" be a connected m-manifold and 
let p € M. Assume M is geodesically complete at p. Then, for every q € M, there 
exists a geodesic y : [0,1] — M such that 


y(0) = p, yQ) =4, L(y) = d(p,q). 


Before giving the proof of the Hopf—Rinow Theorem we show that it implies 
Theorem 4.6.5. 


Theorem 4.6.6 implies Theorem 4.6.5 That (i) implies (ii) follows directly from 
the definitions. 

We prove that (ii) implies (iii). Thus assume that M is geodesically com- 
plete at the point po € M and let K C M be a closed and bounded subset. Then 
r [= SUPycx 4(Po,q) < co. Hence Theorem 4.6.6 asserts that, for every q € K, 
there exists a vector v € T,,M such that |v| = d(po,q) < r and exp, (v) = q. Thus 
Kc exp,, (B,(Po)), B,(po) = {v € TM | |v| < r}. 


Then B := {v € T,,M | |v| <r, exp,,(v) € K} is a closed and bounded subset of 
the Euclidean space T,,,M. Hence B is compact and K = exp,,(B). Since the ex- 
ponential map exp,, : 7),MéM — M is continuous it follows that K is compact. This 
shows that (11) implies (111). 
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We prove that (iii) implies (iv). Thus assume that every closed and bounded 
subset of M is compact and choose a Cauchy sequence p; € M. Choose ip € N 
such that d(p;, p;) < 1 foralli, 7 ¢ N withi, j > io. Define 


c i= max d(pi, pi) + 1. 
SUSI 
Then d(pi, pi) < d(pi, Pip) + A(Pio. Pi) < A(P1, Pio) + 1 < ¢ for all i > ip and 
so d(pi, pi) < c for all i e N. Hence the set {p; |i ¢ N} is bounded and so is 
its closure. By (iii) this implies that the sequence p; has a convergent subsequence. 
Since p; is a Cauchy sequence, this implies that p; converges. Thus we have proved 
that (iii) implies (iv). 

We prove that (iv) implies (v). Fix a smooth curve € : R — R” and an ele- 
ment (po, eo) € F(M). Assume, by contradiction, that there exists a real num- 
ber T > 0 such that there exists a solution B : [0, 7) > F(M) of equation (4.6.1) 
that cannot be extended to the interval [0,7 + ¢) for any ¢ > 0. Write B(t) =: 
(y(t), e(t)) so that y and e satisfy the equations 


Yt) =e@EC), et) =AymYM)e), vO)= po, e(0) = eo. 


This implies e(t)n € Ty) M and é(t)n € TyyM for all 7 € R” and therefore 


d 
< (n.ett)'es) = 4, (en, eS) = (e@)n. e@)S) + (e@)n. es) = 0 


for all n,¢ € R” and all ¢ € [0, 7). Thus the function t +> e(t)'e(t) is constant, 
hence 


e(t)'e(t) = eke, _|je(t)|| = oe leOnl _ peal (4.6.2) 


neR™ | | 


for 0 <t < T, hence 
Iy(@)| = leMEO| < lleolllE@| < lleoll sup Jets) =!Cr 


and so d(y(s), y(t)) < L(y|js.q) < ( — s)er forO < s <t <T. Since (M,d) is 
a complete metric space, this shows that the limit p; := lim; z7 y(t) € M exists. 
Thus the set K := y([0,7)) U {pi} C M is compact and so is the set 


K := {(p,e) € F(M)|p € K, e'e = ejeo} C F(M). 


By equation (4.6.2) the curve [0,7) > R x F(M): tt (t, y(t), e(t)) takes val- 
ues in the compact set [0, 7] x K and is the integral curve of a vector field on the 
manifold R x #(M). Hence Corollary 2.4.15 asserts that [0, 7) cannot be the max- 
imal existence interval of this integral curve, a contradiction. This shows that (iv) 
implies (v). 
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That (v) implies (vi) follows by taking &(t) = & in (v). 

We prove that (vi) implies (i). Fix an element po € M and a tangent vector 
vo € TM. Let eo € Liso(R”, Tp, M) be any isomorphism and choose € € R” such 
that e9§ = vo. By (vi) the vector field By has a unique integral curve R > F(M): 
tH Bit) = (y(t), e(@) with 


B(O) = (po, €o). 


Thus 


Vtp=eME, (1) = hy (eMEe), 


and hence 


P(t) = EME = Ay MeEMHeOE = hyo (VO. VO). 


By the Gau8—Weingarten formula, this implies Vy(t) = O for every t and hence 
y:R— M is a geodesic with y(0) = po and (0) = eo& = vo. Thus M is 
geodesically complete and this shows that (vi) implies (1). 

The equivalence of (v) and (vii) was established in Corollary 3.5.25 and this 
shows that Theorem 4.6.6 implies Theorem 4.6.5. O 


4.6.3 Proof of the Hopf-Rinow Theorem 
The proof of Theorem 4.6.6 relies on the next two lemmas. 


Lemma 4.6.7 Let M C R" be a connected m-manifold and p € M. Suppose ¢ > 0 
is smaller than the injectivity radius of M at p and denote 


Z\(p) := {veET,M ||vj=1}, — S.(p) := {p' EM |d(p, p’) =}. 


Then the map X\(p) — S_(p) : v +> exp, (ev) is a diffeomorphism and, for all 
q € M, we have 


d(p.q) >€ — d(S-(p).q) = d(p.q) — €. 
Proof By Theorem 4.4.4, we have 

d(p, exp,(v)) = |v| for all v € T,M with |v| < 
and 


d(p, p')>€ for all p’ € M \ {exp,(v)|v € 7)M, |u| < e}. 
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This shows that S,(p) = exp, (eX (p)) and, since ¢ is smaller than the injectivity 
radius, the map 


Ei(p) > S.(p) : v > exp, (ev) 


is a diffeomorphism. 
To prove the second assertion, let g € M such that 


r:=d(p,q)>&. 
Fix a constant 6 > 0 and choose a smooth curve y : [0,1] — M such that 
yO=p, vyO=q, Liy)srts. 
Choose fo > 0 such that y (fo) is the last point of the curve on S,(p), ie. 


Y(to) € Se(p), y(t) € S.(p) fort <t <1. 
Then 


A 


d(y(to).4) < LY Ito.) 
L(y) — L(V |[o.t01) 
L(y) -«é 
<r+é-e. 


IA 


This shows that d(S.(p),q) <r +6 -—e for every 6 > 0 and therefore 
d(S-(p),q) <r —e. 
Moreover, 
d(p'.q) 2 d(p.q)—d(p, p')=r—e 
for all p’ € S,(p). Thus 
d(Sc(p),q) =r—€ 


and this proves Lemma 4.6.7. Oo 


Lemma 4.6.8 (Curve Shortening Lemma) Let M C R” be an m-manifold, 
let p € M, and let ¢ be a real number such that 


0<e<inj(p;M). 
Then, for allv,w € T,M, we have 


|v| = |w| =e, d(exp, (v), exp, (w)) = 2¢é => v+w=0. 
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Fig. 4.7 Two unit tangent 
vectors 


Proof We will prove that, for all v, w € 7,M, we have 


var d(exp, (Sv), exp, (6w)) 7 
50 6 


ju — wl. (4.6.3) 


Assume this holds and suppose, by contradiction, that there exist two tangent vec- 
tors v, w € T,M such that 


|v| = |w| = 1, d(exp, (ev), exp, (ew)) = 2¢, v+w #0. 
Then 
ju—w| <2 
(see Fig. 4.7). Thus by (4.6.3) there exists a constant 0 < 6 < e such that 
d(exp, (5v), exp, (dw)) < 26. 
Then 


d(exp, (ev), exp, (ew)) 
< d(exp, (ev), exp, (6v)) + d(exp, (6v), exp, (Sw)) + d(exp, (6w), exp, (ew)) 
<e—64+26+e-5=2e 


and this contradicts our assumption. 
It remains to prove (4.6.3). For this we observe that 


i d(exp, (6v), exp, (Sw)) 


50 6 

Sas d(exp, (5v), exp, (Sw)) |exp, (Sv) - exp, (6w)| 
80 lexp, (Sv) - exp, (dw)| ) 

= lexp, (Sv) - exp, (dw)| 
50 6 

ae exp, (5v) — p _ exp, (6w) — p 
50 6 6 

=|v-w)I. 


Here the second equality follows from Lemma 4.2.7. O 
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Proof of Theorem 4.6.6 By assumption M C R” is a connected submanifold, and 
p € M is given such that the exponential map exp, : T7,M — M is defined on the 
entire tangent space at p. Fix a point g € M \ {p}so that 

0<r:=d(p,q) <o. 


Choose a constant ¢ > 0 smaller than the injectivity radius of M at p and smaller 
than r. Then, by Lemma 4.6.7, we have 


d(S.(p).q) =r —€. 
Hence there exists a tangent vector v € 7,,M such that 
d(exp,(ev),q) =r —6, |v) = 1. 
Define the curve y : [0,r] > M by 
y(t) := exp, (tv) forO<t <r. 
By Lemma 4.3.6, this is a geodesic and it satisfies y(0) = p. We must prove 
that y(r) = q and L(y) = d(p,q). Instead we will prove the following stronger 
statement. 
Claim For every t € [0,r] we have 
d(y(t),q) =r-t. 
In particular, y(r) = q and L(y) =r = d(p,q). 
Consider the subset 
I := {t €[0,r] |dyv@,q) =r—-t} c [0,7]. 
This set is nonempty, because ¢ € J, it is obviously closed, and 
tel — [0,4] CI. (4.6.4) 
Namely, if t €¢ J andO < s <f, then 
d(y(s),q) < d(v(s), y@) +dv@,q) <t-str—-t=r-—s 
and 
d(y(s),4) = d(p.q) — d(p, y(s)) =r—s. 


Hence d(y(s),g) =r —s and hence s € J. This proves (4.6.4). 
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Fig. 4.8 The proof of the Se(¥(t)) exPpfEw) 


Hopf-—Rinow theorem 
G Ey ae 4 


We prove that J is open (in the relative topology of [0,7]). Let t € I be given 
with t < r. Choose a constant ¢ > 0 smaller than the injectivity radius of M at y(t) 
and smaller than r — ¢. Then, by Lemma 4.6.7 with p replaced by y(t), we have 


d(S(y¥@).q =r—-t—e. 
Next we choose w € T,)M such that 
|w| = 1, d(exp,(ew),q) =r—t—e. 
Then 


d(y(t — £), exp,qy(ew)) = d(y(t — €), 4g) — A(exp,q) (ew), g) 
=(r—-tt+e)—-(r—-t-—e) 
= 2e. 


The converse inequality is obvious, because both points have distance ¢ to y(t) (see 


Fig. 4.8). 
Thus we have proved that 


d(y(t — €), exp,q)(ew)) = 2e. 

Since 

y(t —£) = exp,qy(—eV (0), 
it follows from Lemma 4.6.8 that 

w= y(t). 

Hence exp,,)(sw) = y(t + 5) and this implies that 

d(y(t+¢),g)=r—-t-—e. 
Thus ¢t + e € J and, by (4.6.4), we have [0,¢ + ¢] € J. Thus we have proved that 


I is open. In other words, I is a nonempty subset of [0,7] which is both open and 
closed, and hence J = [0,7]. This proves the claim and Theorem 4.6.6. Oo 
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4.7 Geodesics in the Intrinsic Setting* 


This section examines the distance function on a Riemannian manifold, shows how 
the results of this chapter extend to the intrinsic setting, and discusses several ex- 
amples. 


4.7.1 Intrinsic Distance 


Let M be a connected smooth manifold (Sect. 2.8) equipped with a Riemannian 
metric (Sect. 3.7). Then we can define the length of a curve y : [0,1] ~ M by 
the formula (4.1.1) and it is invariant under reparametrization as in Remark 4.1.3. 
The distance function d : M x M — R is then given by the same formula (4.2.2). 
We prove that it still defines a metric on M and that this metric induces the same 
topology as the smooth structure. 


Lemma 4.7.1 Let M be a connected smooth Riemannian manifold and define the 
function d : M x M — [0, 00) by (4.1.1), (4.2.1), and (4.2.2). Then d is a metric 
and induces the same topology as the smooth structure. 


Proof The proof has three steps. 


Step 1 Fix a point py © M and let 6: U > 2 be a coordinate chart of M onto 
an open subset 2 CR” such that po € U. Then there exists an open neighborhood 
V CU of po and constants 6,r > 0 such that 


5|¢(p) — b(Po)| < Ap, Po) <= 8'1¢(P) — b(o)| (4.7.1) 
for every p € V and d(p, po) = 65r for every pE M\ V. 


Denote the inverse of the coordinate chart ¢ by y := @~!' : 2 > M and define the 
map § = (8ij)j'j=1 : & > R™ by 


oy 
ox! 


0 
sy) = (Soa), FG] 


Wr) 
for x € §2. Then a smooth curve y : [0,1] > U has the length 


1 
L(y) = / Ve(t)Tg(c()je(t)dt, — c(t) := (vy). (4.7.2) 
0 


Let xo := 6(po) € @ and choose r > 0 such that B,(xo) C 2. Then there is a 
constant 6 € (0, 1] such that 


BIEL < VETe(a)é < 5-16 (4.7.3) 
for all x € B,(xo) and &,n € R”. Define V := "!(B,(xo)) C U. 
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Now let p € V and denote x := ¢(p) € B,(xo). Then, for every smooth curve 
y : [0,1] ~ V with y(O) = po and y(1) = p, the curve 


cr goy 


takes values in B,(xo9) and satisfies c(0) = xo and c(1) = x. Hence, by (4.7.2) 
and (4.7.3), we have 


1 


1 
L(y) = 8 [ \e(lat >6 fewar = 6|x — Xo]. 
0 


0 


If y : [0, 1] ~ M is a smooth curve with endpoints y(0) = po and y(1) = p whose 
image is not entirely contained in V, then there exists a T € (0, 1] such that y(t) € V 
for 0 <t < T and y(T) € OV, so c(t) = d(y(t)) € B, (Xo) for 0 < t < T and 
|c(T) — xo| = r. Hence, by the above argument, we have 


L(y) = 6r. 


This shows that d(po, p) = 6r for p €¢ M \ V and d(po, p) = 6|¢(p) — }(po)| for 
pEeV.If pe V,x:= (p), and c(t) := x9 + t(x — Xo), then y := woc is asmooth 
curve in V with y(0) = po and y(1) = p and, by (4.7.2) and (4.7.3), 


1 
L(y) <8" fico dt =6"|x —xol. 
0 


This proves Step 1. 
Step 2 d is a distance function. 


Step | shows that d(p, po) > 0 for every p € M \ {po} and hence d satisfies condi- 
tion (i) in Lemma 4.2.3. The proofs of (ii) and (iii) remain unchanged in the intrinsic 
setting and this proves Step 2. 


Step 3 The topology on M induced by d agrees with the topology induced by the 
smooth structure. 


Assume first that W C M is open with respect to the manifold topology and let 
Po € W.Let@: U > 2 be acoordinate chart of M onto an open subset 22 C R” 
such that po € U, and choose V C U and 6,r as in Step 1. Then ¢(V N W) 
is an open subset of (2 containing the point #(po). Hence there exists a con- 
stant 0 < e < é6r such that Bs-1,(¢(po)) C 6(V N W). Thus by Step 1 we have 
d(p, po) = 6r => « for all p € M \ V. Hence, if p € M satisfies d(p, po) < €, 
then p € V, so |f(p) — 6(po)| < 57!d(p, po) < 5~1e by (4.7.1), and therefore 
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o(p) € OV OW). Thus B.(po:d) C W and this shows that W is open with 
respect to d. 

Conversely, assume that W C M be open with respect to d and choose a coordi- 
nate chart @: U — 92 onto an open set $2 C R”. We must prove that #(U NW) is an 
open subset of 92. To see this, choose x9 €¢(U NW) and let po:=@7!(x0) EeUNW. 
Now choose V C U and 6,r as in Step 1. Choose ¢ > 0 such that Bs-1,(p0; d) C W 
and B,(xo) C d(V). Let x € IR” such that |x — x9| <e. Then x € f(V) and therefore 
p:=@ '(x)€V. This implies d(p, po) <8~'|6(p) — (po)| = 5"! |x — xo] < 6-1, 
thus p ¢ WOU, andsox = ¢(p) € 6(W NU). Thus ¢(W NU) is open, and so 
W is open in the manifold topology of M. This proves Step 3 and Lemma 4.7.1. 0 


4.7.2. Geodesics and the Levi-Civita Connection 


With the covariant derivative understood (Theorem 3.7.8), we can define geodesics 
on M as smooth curves y : J > M that satisfy the equation Vy = 0, as in Defini- 
tion 4.1.5. Then all the above results about geodesics, as well as their proofs, carry 
over almost verbatim to the intrinsic setting. In particular, geodesics are in local 
coordinates described by equation (4.3.6) (Lemma 4.3.14) and they are the critical 
points of the energy functional 


1 
1 
EW) = 5 [word 
0 


on the space {2,,, of all paths y : [0,1] > M with fixed endpoints y(0) = p 
and y(1) = q. Here we use the fact that Lemma 4.1.7 extends to the intrinsic setting 
via the Embedding Theorem 2.9.12. So for every vector field X € Vect(y) along y 
with X(0) = 0 and X(1) = 0 there exists a curve of curves R > Qpq i St Ys 
with yo = y and 0sys|s=0 = X. Then, by the properties of the Levi-Civita connec- 
tion, we have 


dl 
1 
dE(y)X = 5 / aglaiy (OL at 
0 


(y(t), Vi X(t) dt 


II 
oN 


1 
== | (HO). xe) at. 
0 


The right hand side vanishes for all X if and only if Vy = 0 (Theorem 4.1.4). With 
this understood, we find that, for all p ¢ M and v € 7,M, there exists a unique 
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geodesic y : Jp, — M ona maximal open interval J, C R containing zero that 
satisfies y(0) = p and y(0) = v (Lemma 4.3.4). 
This gives rise to a smooth exponential map 


exp, : V) ={vé7T,M|1€I,,}>M 
as in Sect. 4.3 which satisfies 
d exp, (0) = id: T,M > T,M 


as in Corollary 4.3.7. This leads directly to the injectivity radius, the Gau8 
Lemma 4.4.5, the local length minimizing property of geodesics in Theorem 4.4.4, 
and the Convex Neighborhood Theorem 4.5.3. Also the proof of the equivalence 
of metric and geodesic completeness in Theorem 4.6.5 and of the Hopf—Rinow 
Theorem 4.6.6 carry over verbatim to the intrinsic setting of general Riemannian 
manifolds. The only place where some care must be taken is in the proof of the 
Curve Shortening Lemma 4.6.8 as is spelled out in Exercise 4.7.2 below. 


4.7.3 Examples and Exercises 


Exercise 4.7.2. Choose a coordinate chart ¢ : U > 92 with (po) = 0 such that 
the metric in local coordinates satisfies 


gi; (0) = 4;. 
Refine the estimate (4.7.1) in the proof of Lemma 4.7.1 and show that 


i d(p.q) 
im ———_ = 
pao |b(p) — $(q)| 


This is the intrinsic analogue of Lemma 4.2.8. Use this to prove that equation (4.6.3) 
continues to hold for all Riemannian manifolds, i.e. 


_ d(exp, (Sv), exp, (Sw)) _ 
50 6 


|u—w| 


for p € M and v,w ¢€ T,M. With this understood, the proof of the Curve Short- 
ening Lemma 4.6.8 carries over verbatim to the intrinsic setting. 


Exercise 4.7.3. The real projective space RP” inherits a Riemannian metric from 
S” as it is a quotient of S” by an isometric involution. Prove that each geodesic in 
S” with its standard metric descends to a geodesic in RP”. 
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Exercise 4.7.4 Let f : S* > S? be the Hopf fibration defined by 
f(z,w) = (21? — |w|?, 2Re Zw, 21m 2w) 


Prove that the image of a great circle in S? is a nonconstant geodesic in S$? if and 
only if it is orthogonal to the fibers of £, which are also great circles. Here we 
identify S? with the unit sphere in C*. (See also Exercise 2.5.22.) 
Exercise 4.7.5 Prove that a nonconstant geodesic y : R > $?"*! descends to a 
nonconstant geodesic in CP” with the Fubini-Study metric (see Example 3.7.5) if 
and only if y(t) L Cy(¢) for every ¢t € R. 
Exercise 4.7.6 Consider the manifold 

F(R") = {D eR” | D'D = 1} 
of orthonormal k-frames in R”, equipped with the Riemannian metric inherited 
from the standard inner product (X,Y) := trace(X'Y) on the space of real n x k- 
matrices. 


(a) Prove that 


Tp F(R") = {X €R"™ | D'X + X™D =O}, 
Tp Fx(R")* = {DA| A = ATE RF, 


and that the orthogonal projection J7(D) : R"’** — Tp F;(IR") is given by 
1 
I(D)X =X - 5D(Dix + X'D). 
(b) Prove that the second fundamental form of ¥;(IR”) is given by 
! T T 
hp(X)Y = —5D(X Y+Y'X) 


for D € F(R”) and X,Y € Tp F(R"). 
(c) Prove that a smooth map R > F;(IR") : tf bh D(f) is a geodesic if and only if 
it satisfies the differential equation 


D=-—DD'D. (4.7.4) 


Prove that the function D" D is constant for every geodesic in F; (R”). Compare 
this with Example 4.3.12. 
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Exercise 4.7.7. Let G;,(R”") = F;,(R”)/O(k) be the real Grassmannian of k- 
dimensional subspaces in IR”, equipped with a Riemannian metric as in Exam- 
ple 3.7.6. Prove that a geodesics R > F; (IR”):t t+ D(t) descends to a nonconstant 
geodesic in G; (R") if and only if D'D = 0 and D # 0. Deduce that the exponential 
map on G;(R”) is given by 


exp,(A) = im( D cos((B"D)") ae (BB) sin( (0"D)')) 


for A € F(R”) and AET, F,,(R") \ {0}. Here we identify the tangent space 
T,F;.(R") with the space of linear maps from A to At, and choose the matrices 
D € F(R") and D € R"** such that 


A=imD, DD =o, AoD=D:R* = At =kerD'. 


Prove that the group O(7) acts on G,(R”) by isometries. Which subgroup acts 
trivially? 


Exercise 4.7.8 Carry over Exercises 4.7.6 and 4.7.7 to the complex Grassmannian 
G;.(C”). Prove that the group U(7) acts on G;,(C”) by isometries. Which subgroup 
acts trivially? 


® 
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Curvature 


This chapter begins by introducing the notion of an isometry (Sect. 5.1). It shows 
that isometries of embedded manifolds preserve the lengths of curves and can be 
characterized as diffeomorphisms whose derivatives preserve the inner products. 
The chapter then moves on to the Riemann curvature tensor and establishes its sym- 
metry properties (Sect. 5.2). That section also includes a discussion of the covariant 
derivative of a global vector field. The next section is devoted to the generalized 
Gau8 Theorema Egregium which asserts that isometries preserve geodesics, the co- 
variant derivative, and the Riemann curvature tensor (Sect. 5.3). The final section 
examines the Riemann curvature tensor in local coordinates and shows how the 
definitions and results of the present chapter carry over to the intrinsic setting of 
Riemannian manifolds (Sect. 5.4). 


5.1 Isometries 
Let M and M' be connected submanifolds of R”. An isometry is an isomorphism of 
the intrinsic geometries of M and M’. Recall the definition of the intrinsic distance 
function 

d:MxM = (0,c) 


in Sect. 4.2 by 


1 
d(p.q)i= int LY. LO) = [eOlat 
0 


for p,q € M. Let d' denote the intrinisic distance function on M’. 
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Theorem 5.1.1 (Isometries) Let ¢: M — M' be a bijective map. Then the fol- 
lowing are equivalent. 


(i) @ intertwines the distance functions on M and M’, i.e. 


d'(¢(p).6(q)) = (p,q) 


forall p,qeM. 
Gi) @ is a diffeomorphism and 


do(p): T,M > Typ) M' 


is an orthogonal isomorphism for every p € M. 
(iii) @ is a diffeomorphism and 


Lipoy) = L(y) 
for every smooth curve y : [a,b] > M. 
The bijection ¢ is called an isometry iff it satisfies these equivalent conditions. In 
the case M = M’ the isometries ¢ : M — M forma group denoted by 1(M) and 
called the isometry group of M. 


The proof is based on the following lemma. 


Lemma 5.1.2 For every p € M there exists a constant ¢ > 0 such that, for 
allv,w € T,M with 0 < |w| < |v| < & we have 


d(exp, (w), exp, (v)) = |v| — |w| = w= —v. (5.1.1) 


Remark 5.1.3 It follows from the triangle inequality and Theorem 4.4.4 that 


d(exp,(v), exp,(w)) > d(exp,(v), p) — d(expp(w). p) 
= |v] = |) 


whenever 0 < |w| < |v| < inj(p). Lemma 5.1.2 asserts that equality can only hold 
when w is a positive multiple of v or, to put it differently, that the distance between 
exp,(v) and exp,(w) must be strictly bigger that |v| — |w| whenever w is not a 
positive multiple of v. 
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Proof of Lemma 5.1.2 As in Corollary 4.3.8 we denote 


B.(p):= {v € T,M ||v| < eh, 
U-(p) = tg € M | d(p.q) < &}. 


By Theorem 4.4.4 and the definition of the injectivity radius, the exponential map at 
p is a diffeomorphism exp, : B.(p) — U,(p) for € < inj(p). Choose 0 < r < inj(p). 
Then the closure of U,(p) is acompact subset of M. Hence there is a constant e > 0 
such that e < r and e < inj(p’) for every p’ € U,(p). Since e < r we have 


€ < inj(p’) Vp’ €U,(p). (5.1.2) 
Thus exp, : B.(p’) > U,(p’) is a diffeomorphism for every p’ € U,(p). Define 
Pi ‘= exp,(w) and pz := exp,(v). Then, by assumption, we have d(pj, p2) = 


|v| — |w| < e. Since p; € U,(p) it follows from our choice of ¢ that ¢ < inj(p1). 
Hence there is a unique tangent vector v; € 7,,M such that 


lui] = d(pi, p2) = |v|—|wl], exp, (v1) = pr. 


Following first the shortest geodesic from p to p; and then the shortest geodesic 
from p; to p2 we obtain (after suitable reparametrization) a smooth y : [0,2] ~ M 
such that 


y(0) = p, yQ) = pi, y(2) = po, 


and 


L(y |.) = 4(p, pi) = |v, L(y|p.q) = €(p1, p2) = |v| — |wI. 


Thus L(y) = |v| = d(p, pz). Hence, by Theorem 4.4.4, there is a smooth function 
B : [0,2] — [0, 1] satisfying 


BO)=0, B2=1, Bt)20, — y(t) =exp,(B(t)v) 
for every t € [0, 2]. This implies 
exp,(w) = pi = y(1) = exp, (B(1)v), 0< fC) <1. 
Since w and A(1)v are both elements of B,(p) and exp, is injective on B,(p), this 


implies w = B(1)v. Since B(1) => 0 we have B(1) = |w|/|v|. This proves (5.1.1) 
and Lemma 5.1.2. Oo 
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Proof of Theorem 5.1.1 That (ii) implies (iii) follows from the definition of the 
length of a curve. Namely 


b 
d 
Libor) = [| Lo]ar 


b 
7 , Ido (yt) p(t) dt 


In the third equation we have used (ii). That (iii) implies (i) follows immediately 
from the definition of the intrinsic distance functions d and d’. 

We prove that (i) implies (ii). Fix a point p € M and choose € > 0 so small that 
€ < min{inj(p; M), inj(¢(p); MD} and that the assertion of Lemma 5.1.2 holds for 
the point p’ := o(p) € M’. Then there is a unique homeomorphism 


®, : Be(p) > Be(o(p)) 


such that the following diagram commutes. 


® 
T,M > B.(p) —> B-(¢(p)) C Tp(p)M’ . 
eXPp EXP), p) 
g 


Here the vertical maps are diffeomorphisms and ¢ : U,(p) > U;(@(p)) is a home- 
omorphism by (i). Hence ®, : B,(p) > B-(@(p)) is a homeomorphism. 


Claim 1 The map ®, satisfies the equations 


EXP») (Pp(v)) = H(exp,(v)), (5.1.3) 
|P,()| = leh, (5.1.4) 
®, (tv) = t®,(v) 6.15) 


for every v € B,(p) and every t € [0, 1]. 
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Equation (5.1.3) holds by definition. To prove (5.1.4) we observe that, by Theo- 
rem 4.4.4, we have 


|2,(v)| = d'($(P). EXPlp¢p) (Pp (v))) 
= d'(¢(p), 6(exp, (v))) 
= d(p,exp,(v)) 


= |v]. 


Here the second equation follows from (5.1.3) and the third equation from (i). Equa- 
tion (5.1.5) holds for t = 0 because , (0) = 0 and for ¢ = 1 it is a tautology. Hence 
assume 0 < ¢ < 1. Then 


d' (exP\y¢)(Pp(t0)), xPiy¢,(Pp(v))) = a'(#(exp, (tv)), (exp, (v))) 
= d(exp, (tv), exp, (v)) 
= |v| — |r 
= |P,(v)| ~ |2,(¢v)]. 


Here the first equation follows from (5.1.3), the second equation from (i), the third 
equation from Theorem 4.4.4 and the fact that |v| < inj(p), and the last equation 
follows from (5.1.4). Since 0 < |S, (tv)| < |,(v)| < e we can apply Lemma 5.1.2 
and obtain 


|P,(¢v)| 


P= V9, 


®,(v) = tp (v). 


This proves Claim 1. 
By Claim 1, @, extends to a bijective map ©, : T,M — Ty(p)M' via 


1 
®,(v) := 3 Pru), 


where 5 > 0 is chosen so small that 5|v| < e. The right hand side of this equation 
is independent of the choice of 5. Hence the extension is well defined. It is bijective 
because the original map ©, is a bijection from B,(p) to B,(@(p)). The reader 
may verify that the extended map satisfies the conditions (5.1.4) and (5.1.5) for all 
v €T,M andallt = 0. 
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Claim 2 The extended map ®, :T,M — Ty,p)M' is linear and preserves the inner 
product. 


It follows from the equation (4.6.3) in the proof of Lemma 4.6.8 that 


ee sie exp, (¢w)) 


a 


|v — w| = 


at (exp, (tv). $ (exp, (tw))) 

_ d' (EXPi5(p)(Pp v)), EXP%5(p) (Pp (tw))) 
- d exp (t®p(0)) EXP4(p) CPp(w))) 
= \2,(0) — d,(w)|. | 


Here the second equation follows from (i), the third from (5.1.3), the fourth 
from (5.1.4), and the last equation follows again from (4.6.3). By polarization 
we obtain 


2(v, w) = |v? + |wl? — |v — w/? 


= |,(v)|" + |G, (w)|” — |S, (v) — & (w)|? 
= 2(®,(v), Pp(w)). 


Thus ®, preserves the inner product. Hence, for all vj, v2, w € TM, we have 


vi + U2, w) 

U1, W) + (v2, w) 

® 0; P,(w)) + (Pp (v2), Pp(w)) 
P,(V1) + Pp(v2), Pp(w)). 


(D,(v1 + v2), Pp(w)) = 


( 
| 
= 
= 


Since @, is surjective, this implies 
P,(v, + v2) = ®y(v1) + Bp(v2) 
for all v}, v2 € T,M. With vj = v and v2 = —v we obtain 
,(—v) = —®,(v) 
for every v € T,M and by (5.1.5) this gives 
®, (tv) = t®,(v) 


for all v € T,M andt € R. This proves Claim 2. 
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Claim 3 ¢ is smooth and do(p) = ®y. 
By (5.1.3) we have 


b = exp») 0 By oexp,| : Ue(p) > U-(G(p)). 


Since ®, is linear, this shows that the restriction of ¢ to the open set U.(p) is 
smooth. Moreover, for every v € 7, M we have 


d d 
do(pv = =| blexpp(tv)) = =] expyi)(¢®p(v)) = Pp(v). 
t=0 t=0 


Here we have used equations (5.1.3) and (5.1.5) as well as Lemma 4.3.6. This 
proves Claim 3 and Theorem 5.1.1. Oo 


Exercise 5.1.4 Prove that every isometry yw : R” — R” is an affine map 


W(p) = Ap +b 
where A € O(n) and b € R". Thus y is a composition of translation and rota- 
tion. Hint: Let e),..., é, be the standard basis of R”. Prove that any two vectors 
v,w € R” that satisfy 
|v] = |w 
and 
|v —e;| = |w —e;| fori =1,...,” 
must be equal. 
Remark 5.1.5 If yy: R” — R” is an isometry of the ambient Euclidean space with 
w(M) = M’, then certainly ¢ := wy is an isometry from M onto M’. On the 
other hand, if M is a plane manifold 
M ={(0,y,z) € R°|0< y < 2/2} 
and M’ is the cylindrical manifold 
M' = {(x,y,z) € R?|x?+ y? =1, x >0, y > 0}, 


Then the map ¢ : M — M’ defined by 


(0, y.z) := (cos(y), sin(y), 2) 
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is an isometry which is not of the form ¢ = W|. Indeed, an isometry of the form 
go = w\|m necessarily preserves the second fundamental form (as well as the first) 
in the sense that 


dy (p)hp(v, w) = hyp (dW (p)v, dy (p)w) 
for v, w € T,M but in the example h vanishes identically while h’ does not. 
We may thus distinguish two fundamental question: 


I. Given M and M’ when are they extrinsically isomorphic, i.e. when is there an 
ambient isometry yw : R” > R” with ¥(M) = M’? 

II. Given M and M’ when are they intrinsically isomorphic, i.e. when is there an 
isometry ¢ : M — M’ from M onto M’? 


As we have noted, both the first and second fundamental forms are preserved by 
extrinsic isomorphisms while only the first fundamental form need be preserved by 
an intrinsic isomorphism (i.e. an isometry). 

A question which occurred to Gau8 (who worked for a while as a cartographer) 
is this: Can one draw a perfectly accurate map of a portion of the earth? (i.e. a map 
for which the distance between points on the map is proportional to the distance 
between the corresponding points on the surface of the earth). We can now pose 
this question as follows: Is there an isometry from an open subset of a sphere to an 
open subset of a plane? Gaui answered this question negatively by associating an 
invariant, the GauBian curvature K : M — R, toa surface M C R?. According to 
his Theorema Egregium 


K'of=K 
for an isometry ¢ : M — M". The sphere has positive curvature; the plane has zero 
curvature; hence the perfectly accurate map does not exist. Our aim is to explain 


these ideas. 


Local Isometries 
We shall need a concept slightly more general than that of “isometry”. 


Definition 5.1.6 (Local isometry) A smooth map ¢: M — M’ is called a local 
isometry iff its derivative 


dp(p) : T,M — To p)M' 


is an orthogonal linear isomorphism for every p € M. 
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Remark 5.1.7 Let M C R” and M’ C R” be manifolds and ¢: M > M' bea 
map. The following are equivalent. 


(i) ¢ isa local isometry. 
(ii) For every p € M there are open neighborhoods U C M and U' Cc M’ such 
that the restriction of ¢ to U is an isometry from U onto U’. 


That (ii) implies (i) follows immediately from Theorem 5.1.1. On the other hand (i) 
implies that d¢(p) is invertible so that (ii) follows from the inverse function theo- 
rem. 


Example 5.1.8 The map 
R—> S!': 6H e? 
is a local isometry but not an isometry. 


Exercise 5.1.9 Let M Cc R” be a compact connected |-manifold. Prove that M 
is diffeomorphic to the circle S!. Define the length of a compact connected Rie- 
mannian |-manifold. Prove that two compact connected 1-manifolds M, M’' Cc R" 
are isometric if and only if they have the same length. Hint: Let y : R > M be 
a geodesic with |y(t)| = 1. Show that y is not injective; otherwise construct an 
open cover of M without finite subcover. If fo < t; with y(to) = y(t,), show that 
Y(to) = y(t); otherwise show that y (ftp + ¢) = y(t, — 1) for all ¢ and find a contra- 
diction. 


The next result asserts that two local isometries that have the same value and the 
same derivative at a single point must agree everywhere, provided that the domain 
is connected. 


Lemma 5.1.10 Let M CR" and M' C R” be smooth m-manifolds and assume 
that M is connected. Let ¢: M — M' andw:M — M' be local isometries and 
let po € M such that 


(Po) = V(Po) =: Po: db(po) = AW (Po) : Ty) M > TM’. 
Then }(p) = W(p) for every p € M. 
Proof Define the set 


Mo := {p © M|$(p) = W(p), do(p) = dy (p)}. 


This set is obviously closed. We prove that Mp is open. Let p € Mo and choose 
U CM and U’ C Mas in Remark 5.1.7 (ii). Denote 


®, := do(p) = dy(p):1T,M > Ty M', D' := o(p) = V(p) 
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Then it follows from equation (5.1.3) in the proof of Theorem 5.1.1 that there exists 
a constant ¢ > 0 such that U,(p) C U and U,(p’) C U’ and 


q € U-(p) = $(q) = exp), 0 ®, o exp,'(q) = W(q). 


Hence U,(p) C Mo. Thus Mp is open, closed, and nonempty. Since M is connected 
it follows that My = M and this proves Lemma 5.1.10. O 


Exercise 5.1.11 


(i) Ifa sequence of local isometries $; : M — M' converges uniformly to a local 
isometry @: M — M', then it converges in the C™ topology. Hint: Let p « M. 
Then every sufficiently small tangent vector v € T,,M satisfies the equation 
d¢(p)v = (exp,,)) '((exp, (v))). Use this to prove that dd; (p) converges 
to do(p). Deduce that ¢; converges to ¢ uniformly with all derviatives in a 
neighborhood of p. 

(ii) The C™ topology on the space of local isometries from M to M' agrees with 
the C° topology. 


5.2. The Riemann Curvature Tensor 


This section defines the Riemann curvature tensor and proves the Gau®—Codazzi 
formula (Sect. 5.2.1), introduces the covariant derivative of a global vector field 
(Sect. 5.2.2), expresses the curvature tensor in terms of a global formula 
(Sect. 5.2.3), establishes its symmetry properties (Sect. 5.2.4), and examines the 
curvature for a class of Riemannian metrics on Lie groups (Sect. 5.2.5). 


5.2.1 Definition and Gau8-Codazzi 


Let M C R" be a smooth manifold and y : R* — M be a smooth map. Denote 
by (s,¢) the coordinates on R?. Let Z € Vect(y) be a smooth vector field along y, 
i.e. Z : R? + R" is a smooth map such that Z(s,t) € Ts) M for all s and ¢. The 
covariant partial derivatives of Z with respect to the variables s and ¢ are defined 
by 


OZ dZ 
V,Z := T(y)—. V,Z := IT(y)—_. 
as ot 


In particular 0,y = dy/ds and 0,y = dy/dt are vector fields along y and we have 
V;0:y — V;0sy = 0 as both terms on the left are equal to J7(y)0,0,y. Thus ordinary 
partial differentiation and covariant partial differentiation commute. The analogous 
formula (which results on replacing 0 by V and y by Z) is in general false. Instead 
we have the following. 
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Definition 5.2.1. The Riemann curvature tensor assigns to each p € M the bi- 
linear map R, : 7,M x T,M — L£(T,M,T,M) characterized by the equation 


R,(u, v)w = (VV: Z — V:VsZ)(0, 0) (5.2.1) 


foru,v,w<€T,M where y: IR? > M isasmooth map and Z € Vect(y) is a smooth 
vector field along y such that 


y(0,0) = p, dsy(0,0) = u, 0;y(0,0) = v, Z(0,0) =w. (5.2.2) 


We must prove that R is well defined, i.e. that the right hand side of equa- 
tion (5.2.1) is independent of the choice of y and Z. This follows from the Gau}— 
Codazzi formula which we prove next. Recall that the second fundamental form 
can be viewed as a linear map hy :T,M > L(T,M, T,M*) and that, foru € T,M, 
the linear map h,(u) € £(7,M, T,M*) and its dual h,(u)* € L£(T,M+, T,M) are 
given by 


h,(u)v = (dIT(p)u)v, hy(u)*w = (dIT(p)u)w 
forv € 7,M andw € TM. 


Theorem 5.2.2. The Riemann curvature tensor is well defined and given by the 
Gauf—Codazzi formula 


R,(u, v) = hp(u)*hp(v) — hp(v)*hp(u) (5.2.3) 
foru,v €T,M. 
Proof Let u,v,w € T,M and choose a smooth map y : IR? — M anda smooth 
vector field Z along y such that (5.2.2) holds. Then, by the GauB—Weingarten for- 


mula (3.2.2), we have 


V,Z =0,Z —hy(0;y)Z 
= 0,Z — (dII(y)0,y)Z 
= 0,Z — (0,(IT 0 y))Z. 


Hence 
IVZ = 9,0,2 = a,((3,(7 ° y))Z) 
= 0,0,Z — (0,9,(IT o y))Z — (0,(IT° y))d,Z 
= 0,0;Z — (0,0; (IT 0 y))Z — (dIT(y)d,v)(VsZ + hy (dsv)Z) 
= 0,0;Z — (0,8;(H 0 y))Z — hy (0,y)VsZ — hy (0,y)*hy Osy)Z. 
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Interchanging s and f and taking the difference we obtain 
AsV: Z —O;VsZ = hy (Osy)*hy (Ory) Z — hy (Ory) *hy Osy)Z 
+ h,(0sy)ViZ — hy (0:y)Vs5Z. 


Here the first two terms on the right are tangent to M and the last two terms on the 
right are orthogonal to 7; M. Hence 


VsViZ = ViVsZ = IT(y)(0sViZ —_ 0,VsZ) 
= hy (Osy)*hy (Oy) Z me hy (0;y)"hy(Osy)Z. 


Evaluating the right hand side at s = tf = 0 we find that 
(V;V;iZ — V,V;Z)(0, 0) = h,(u)*hp(v)w — hp (v)*hp(u)w. 


This proves the Gauf—Codazzi equation and shows that the left hand side is inde- 
pendent of the choice of y and Z. This proves Theorem 5.2.2. Oo 


5.2.2. Covariant Derivative of a Global Vector Field 


So far we have only defined the covariant derivatives of vector fields along curves. 
The same method can be applied to global vector fields. This leads to the following 
definition. 


Definition 5.2.3 (Covariant derivative) Let MM C R” be an m-dimensional sub- 
manifold and X be a vector field on M. Fix a point p € M and a tangent vector 
v € T,M. The covariant derivative of X at p in the direction v is the tangent 
vector 


V »X(p) := I(p)dX(p)v € T,M, 


where I7(p) € R"*" denotes the orthogonal projection onto 7,M. 


Remark 5.2.4 Let y : J — M bea smooth curve on an interval J C R and let 
X € Vect(M) be a smooth vector field on M. Then X o y is a smooth vector field 
along y and the covariant derivative of X o y is related to the covariant derivative 
of X by the formula 


V(X 0 y)(t) = Vin X(V(0)). (5.2.4) 


Remark 5.2.5 (Gau$}—Weingarten formula) Differentiating the equation X = 
ITX (understood as a function from M to R”) and using the notation 0, X(p) := 
dX(p)v for the derivative of X at p in the direction v we obtain the GauB- 
Weingarten formula for global vector fields: 


OyX(p) = VoX(p) + Ap(v)X(p). (32.9) 
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Remark 5.2.6 (Levi-Civita connection) Differentiating a vector field Y on M 
covariantly in the direction of another vector field X we obtain a vector field Vy Y € 
Vect(M) defined by 


(Vy Y)(p) := Vxcp) Y(p) 
for p € M. This gives rise to a family of linear operators 
Vx : Vect() — Vect(M), 
one for each vector field X¥ € Vect(M), and the assignment 
Vect(M) + L£(Vect(M), Vect(M)) : X +> Vy (5.2.6) 


is itself a linear operator. This linear operator is called the Levi-Civita connection 
on the tangent bundle 7M. 
The Levi-Civita connection (5.2.6) satisfies the conditions 


Vex (Y) = f Vx, (5.2.7) 
Vx(fY) = fVx¥ + (Lx f)Y, (5.2.8) 
fel, Z) = (Wry, Z) + 1, VeZ), (5.2.9) 

VyX —VxY = [X,Y] (5.2.10) 


for all X,Y,Z © Vect(M) and all f € F(M), where Ly f = df o X and 
[X,Y] € Vect(M) denotes the Lie bracket of the vector fields X and Y. The 
conditions (5.2.7) and (5.2.8) assert that the linear operator (5.2.6) is a connection 
on the tangent bundle 7M, condition (5.2.9) asserts that the connection (5.2.6) 
is Riemannian (i.e. it is compatible with the first fundamental form), and condi- 
tion (5.2.10) asserts that it is torsion-free. 


The next lemma shows that the Levi-Civita connection (5.2.6) is uniquely de- 
termined by (5.2.9) and (5.2.10), and hence is the unique torsion-free Riemannian 
connection on the tangent bundle TM. 


Lemma 5.2.7 (Uniqueness Lemma) There is a unique linear operator 
Vect(M) — L£(Vect(M), Vect(M)) : X & Vy 


satisfying equations (5.2.9) and (5.2.10) for all X,Y, Z € Vect(M). 


Proof Existence follows from the properties of the Levi-Civita connection. 
We prove uniqueness. Let X +» Dy be any linear operator from Vect(M) to 
L£(Vect(M), Vect(M/)) that satisfies (5.2.9) and (5.2.10). Then we have 
Ly (X, Z) = (DyX. o: + (X, DyZ), 
—L7(X,Y) = —(DzX,Y) —(X, DzY). 
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Adding these three equations we find 

Ly (Y,Z) + Ly (Z, X) —Lz(X,Y) 

= 2(DyY, Z) + (DyX — DyY, Z) 

+(x, DyZ.=— Dz¥) + (¥, DgZ = DzX) 

= 2(DxY, Z) + ([X, Y], Z) + (X,[Z, Y]) + (¥,[Z, X]). 

The same equation holds for the Levi-Civita connection and hence 
(DyY, Z) = (VyY,Z). 

This implies Dy Y = VyY forall X,Y € Vect(M). Oo 


Exercise 5.2.8 In the proof of Lemma 5.2.7 we did not actually use the assumption 
that the operator Dy : Vect(/Z) — Vect(M) is linear nor that the operator X th Dy 
is linear. Prove directly that if a map 


Dy : £(M) > £(M) 


satisfies (5.2.9) for all Y, Z € Vect(M), then Dy is linear. Prove that every map 
Vect(M) — L£(Vect(M), Vect()) : X +» Dy that satisfies (5.2.10) is linear. 


Exercise 5.2.9 Let ¢' be the flow of a complete vector field X¥ on M and let y' 
be the flow of a complete vector field Y on M. 


(i) Prove that the formula X(p, v) := (X(p), dX(p)v) defines a vector field on 
the tangent bundle 7M. Hint: Lemma 4.3.1 and equation (5.2.5). 

(ii) Prove that the flow of X is given by @ (p, v) := (#'(p), do" (p)v). 

(iii) Prove that the vector fields t~!((w‘)*X — X) converge to [X,Y] in the C! 
topology as f tends to zero. Hint: Establish C° convergence in Lemma 2.4.18 
and then use this result for the vector fields X and Y. 


Remark 5.2.10 (The Levi-Civita connection in local coordinates) Let ¢:U — 
92 be a coordinate chart on an open set U C M with values in an open set 22 C R”. 
In such a coordinate chart a vector field X € Vect(M) is represented by a smooth 
map & = (&!,...,&”) : 2 — R” defined by 


§($(p)) = db(p)X(p) 
for p € U.If Y € Vect(M) is represented by 7, then Vy Y is represented by the map 


m ané : m ; : 
Vi = - Dien’. 5.2.11 
(Ven) 2 378 +2, yen (5.2.11) 
Here the ry : Q — R are the Christoffel symbols defined by 
; m 1 Oki O20; Ogi; 
C= ae f_ |, 5.2.12 
o 2 ae ( ox! = ax! -Ox® ( 


t=] 
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where gj; is the metric tensor and g’/ is the inverse matrix so that 
ik k 
‘> gig’ = 4; 
J 


(see Lemma 3.6.5). This formula can be used to prove the existence statement in 
Lemma 5.2.7 and hence define the Levi-Civita connection in the intrinsic setting. 
5.2.3 A Global Formula 
Lemma 5.2.11 For X,Y, Z € Vect(M) we have 

R(X, Y)Z =VxVyZ—VyVyZ + VixyZ. (5.2.13) 
Proof Fix a point p € M. Then the right hand side of equation (5.2.13) at p 
remains unchanged if we multiply each of the vector fields X, Y, Z by a smooth 
function f : M — [0,1] that is equal to one near p. Choosing f with compact 


support we may therefore assume that the vector fields X and Y are complete. Let 
¢* denote the flow of X and yw’ the flow of Y. Define the map y : R? > M by 


y(s,t):=¢% ow'(p), st ER. 
Then 
Isy=X(y), Or = GLY)(Y). 
Hence, by Remark 5.2.4 we have 
Vs(Z ov) =(VxZ)(y), = -Vi(Z oy) = (Vosy Z)(y). 
Using Remark 5.2.4 again we obtain 


V;Vi(Z ov) = Vay(Vosy Z)(v) + (Va,esyZ)(y), 
V,Vs(Z oy) = (Vosy VxZ)(y). 


Since 
0 
ac| GY = [X,Y] 
OS |, 


and d;y = X(y), it follows that 


VsVi(Z 0 y)(0,0) = (Vx Vv Z + Vix.7}Z)(p). 
V,Vs(Z 0 y)(0,0) = (Vy Vx Z)(p). 
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Hence 


R,(X(p), Y(p))Z(p) = (VsVi(Z ov) — ViVs(Z © v)) (0, 0) 
= (Vy Vy Z — Vy VyZ + Vix.xjZ)(p). 


This proves Lemma 5.2.11. O 


Remark 5.2.12 Equation (5.2.13) can be written succinctly as 
[Vy, Vy] + Vixy) = R(X, Y). (5.2.14) 
This can be contrasted with the equation 
[Ly, Ly] + Lyx] =0 (5.2.15) 


for the operator Ly on the space of real valued functions on M. 


Remark 5.2.13 Equation (5.2.13) can be used to define the Riemann curvature 
tensor. To do this one must again prove that the right hand side of equation (5.2.13) 
at p depends only on the values X(p), Y(p), Z(p) of the vector fields X, Y, Z at 
the point p. For this it suffices to prove that the map 


Vect(M) x Vect(M) x Vect(M) — Vect(M): (X,Y, Z) B R(X, Y)Z 
is linear over the Ring F(M) of smooth real valued functions on M, ie. 
R(fX, Y)Z = R(X, fY)Z = R(X, Y)fZ = fR(X,Y)Z (5.2.16) 


for X,Y,Z € Vect(M) and f € F(M). The formula (5.2.16) follows from the 
equations (5.2.7), (5.2.8), (5.2.15), and [X, fY] = f[X, Y] — (Ly f)Y. It follows 
from (5.2.16) that the right hand side of (5.2.13) at p depends only on the vec- 
tors X(p), Y(p), Z(p). The proof requires two steps. One first shows that if X 
vanishes near p, then the right hand side of (5.2.13) vanishes at p (and simi- 
larly for Y and Z). Just multiply X by a smooth function equal to zero at p 
and equal to one on the support of X; then fX = X and hence the vector field 
R(X, Y)Z = R(fX,Y)Z = fR(X, Y)Z vanishes at p. Second, we choose a local 
frame E1,..., Em € Vect(M), i.e. vector fields that form a basis of 7, M for each 
p in some open set U C M. Then we may write 


x= #8, Y= SWE; Z=) OE 
i=1 j 


in U. Using the first step and the ¥(M/)-multilinearity we obtain 
R(X,Y)Z = o> &' ni CK RUE;, Ej) Ex 
ipk=1 


in U. If X'(p) = X(p), then &'(p) = &"(p) so if X(p) = X'(p), Y(p) = Y'(p), 
Z(p) = Z'(p), then (R(X, Y)Z)(p) = (R(X’, Y’)Z')(:p) as required. 
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5.2.4 Symmetries 


Theorem 5.2.14 The Riemann curvature tensor satisfies 


R(Y, X) = —R(X,Y) = R(X,Y)*, (5.2.17) 
R(X, Y)Z + R(Y,Z)X + R(Z, X)Y =0, (5.2.18) 
(R(X, Y)Z, W) = (R(Z, W)X,Y) (5.2.19) 


for X,Y,Z,W € Vect(M). Equation (5.2.18) is the first Bianchi identity. 


Proof The first equation in (5.2.17) is obvious from the definition and the second 
follows from the Gau8—Codazzi formula (5.2.3). Alternatively, choose a smooth 
map y : R? — M and two vector fields Z, W along y. Then 


0 = 0;0;(Z, W) — 0,05(Z, W) 
= 0,(V,Z,W) +.05(Z, V,W) —0,(Vs5Z, W) — 0:(Z, VsW) 
= (VsV;Z,W) + (Z,VsViW) —(ViV5Z,W) — (Z,ViVsW) 
= (R(0sy,0;y)Z,W) + (Z, ROsy, 0:y)W). 


This proof has the advantage that it carries over to the intrinsic setting. We prove 
the first Bianchi identity using (5.2.10) and (5.2.13): 


R(X,Y)Z + R(Y,Z)X + R(Z, X)Y 

= Ve Vyl = Vr VeZ + VieenZ + VeV2k = VeVeXk $V 
+ VzVxY¥ —VxVzY + Viz.x\¥ 

= Viy.z)X — Vx[¥, Z] + Viz.xy¥ — Vy[Z,X] + VivyjZ — Vz[X. Y] 

= [X,[Y, Z]] + [Y, |Z, X]] + [Z, |X, Y]]. 


The last term vanishes by the Jacobi identity. We prove (5.2.19) by combining the 
first Bianchi identity with (5.2.17): 


(R(X, Y)Z, W) — (R(Z, W)X, Y) 

= —(R(Y,Z)X,W) —(R(Z, X)Y, W) — (R(Z, W) X,Y) 
= (R(Y, Z)W, X) + (R(Z, X)W,Y) + (R(W, Z)X,Y) 
= (R(Y, Z)W, X) — (R(X, W)Z.Y) 

= (R(Y, Z)W, X) — (R(W, X)Y, Z). 


Note that the first line is related to the last by a cyclic permutation. Repeating this 
argument we find 


(R(Y, Z)W, X) — (R(W, X)Y, Z) = (R(Z, W) X,Y) — (R(X, Y)Z,W). 


Combining the last two identities we obtain (5.2.19). This proves Theorem 5.2.14. 
Oo 
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Remark 5.2.15 We may think of a vector field X on M as a section of the tangent 
bundle. This is reflected in the alternative notation 


92°(M, TM) := Vect(M). 


A 1-form on M with values in the tangent bundle is a collection of linear maps 
A(p): T,M — T,M, one for every p € M, which is smooth in the sense that for 
every smooth vector field X on M the assignment p +» A(p)X(p) defines again a 
smooth vector field on M. We denote by 


2'(M,TM) 


the space of smooth 1-forms on M with values in 7M. The covariant derivative of 
a vector field Y is such a 1-form with values in the tangent bundle which assigns to 
every p € M the linear map 7, M — T,M :v+> V,Y(p). Thus we can think of 
the covariant derivative as a linear operator 


V :2°(M,TM) > 2'(M,TM). 


The equation (5.2.7) asserts that the operators X +» Vy indeed determine a linear 
operator from 2°(M, TM) to 2!(M, TM). Equation (5.2.8) asserts that this linear 
operator V is a connection on the tangent bundle of M. Equation (5.2.9) asserts that 
V is a Riemannian connection and equation (5.2.10) asserts that V is torsion-free. 
Thus Lemma 5.2.7 can be restated as asserting that the Levi-Civita connection is 
the unique torsion-free Riemannian connection on the tangent bundle. 


Exercise 5.2.16 Extend the notion of a connection to a general vector bundle E, 
both as a collection of linear operators Vy : 2°(M, E) > 2°(M, E), one for every 
vector field X € Vect(M), and as a linear operator 


V : 2°(M, E) > 2'(M, E) 


satisfying the analogue of equation (5.2.8). Interpret this equation as a Leibniz rule 
for the product of a function on M with a section of E. Show that V+ is a connection 
on 7M +. Extend the notion of curvature to connections on general vector bundles. 


Exercise 5.2.17 Show that the field which assigns to each p € M the multi-linear 
map R :T,M x T,M > LM, tM) characterized by 


R*(Asy, 0:y)Y¥Y = ViViY —ViViY 
for y : R? > M and € Vect+(y) satisfies the equation 
Ry (u,v) = hp(u)hp(v)* — hy (v)hp (u)* 


for p € M andu,v €7,M. 


5.2. The Riemann Curvature Tensor 235 
5.2.5 Riemannian Metrics on Lie Groups 


We begin with a calculation of the Riemann curvature tensor on a Lie subgroup of 
the orthogonal group O(7) with the Riemannian metric inherited from the standard 
inner product 


(v,w) := trace(v'w) (5.2.20) 


on the ambient space gl(n,R) = R”™”. This fits into the extrinsic setting used 
throughout most of this book. Note that every Lie subgroup of O(7) is a closed 
subset of O(7) by Theorem 2.5.27 and hence is compact. 


Example 5.2.18 Let G C O(n) be a Lie subgroup and let 
g := Lie(G) = T,G 


be the Lie algebra of G. Consider the Riemannian metric on G induced by the inner 
product (5.2.20) on R”*”. Then the Riemann curvature tensor on G can be expressed 
in terms of the Lie bracket (see item (d) below). 


(a) The maps g +> ag, g > ga, gt» g’! are isometries of G for every a € G. 
(b) Let y : R ~ G be a smooth curve and X € Vect(y) be a smooth vector field 
along y. Then the covariant derivative of X is given by 


d 1 
y(t) 'VX(t) = Tio Xo + sLYO VO, 10 XO). (5.2.21) 


(Exercise: Prove equation (5.2.21). Hint: Since g C o(n) we have the identity 
trace((En + n&)C) = 0 for all €,n,f € g.) 

(c) A smooth map y : R > Gis a geodesic if and only if there exist matrices g € G 
and & € gq such that 


y(t) = gexp(té). (G22)) 


For G = O(n) we have seen this in Example 4.3.12 and in the general 
case this follows from equation (5.2.21) with X = y. Hence the exponen- 
tial map exp : gq — G defined by the exponential matrix (as in Sect. 2.5) agrees 
with the time-1-map of the geodesic flow (as in Sect. 4.3). 

(d) The Riemann curvature tensor on G is given by 


1 
g'Re(u, vw = —Jllg ug lv), gv) (5.2.23) 


for g € Gand u,v, w € T,G. Note that the first Bianchi identity is equivalent 
to the Jacobi identity. (Exercise: Prove equation (5.2.23).) 
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Definition 5.2.19 (Bi-invariant Riemannian metric) Let G be a Lie subgroup of 
GL(n, R) and let g = Lie(G) = 7G be its Lie algebra. A Riemannian metric on G 
is called bi-invariant iff it has the form 


(v,w)¢ := (vg7', wg) (5.2.24) 


for g € Gand v, w € T,G, where (-, -) is an inner product on the Lie algebra g that 
is invariant under conjugation, i.e. it satisfies the equation 


(En) = (g&g7'. gng7'). (5.2.25) 


for all €,7 € g andall g €G. 


Exercise 5.2.20 Prove that the Riemannian metric induced by (5.2.20) on any Lie 
subgroup G C O(n) is bi-invariant. 


Exercise 5.2.21 Use the Haar measure ([66, Chapter 8]) to prove that every com- 
pact Lie group admits a bi-invariant Riemannian metric. 


Exercise 5.2.22 Prove that all the assertions in Example 5.2.18 carry over verbatim 
to any Lie group equipped with a bi-invariant Riemann metric. 


Exercise 5.2.23 (Invariant inner product) Prove that, if an inner product on the 
Lie algebra g of a Lie group G is invariant under conjugation, then it satisfies the 
equation 


(1§. n].o) = (&. [n. 1) (5.2.26) 


for all &,7,¢ € g. If G is connected, prove that, conversely, equation (5.2.26) im- 
plies (5.2.25). An inner product on an arbitrary Lie algebra g is called invariant iff 
it satisfies equation (5.2.26). 


Exercise 5.2.24 (Commutant) Let g be a finite-dimensional Lie algebra. The lin- 
ear subspace spanned by all vectors of the form [&, 7] is called the commutant of g 
and is denoted by [g, g] := span{[E, 7] | &, 7 € g}. If g is equipped with an invariant 
inner product, prove that [g, g]+ = Z(q) is the center of g (Exercise 2.5.34) and 
hence g = [g, 9] ® Z(g). Prove that the Heisenberg algebra § in Exercise 2.5.15 
satisfies [h, h] = Z(h) and hence does not admit an invariant inner product. 


Example 5.2.25 (Killing form) Every finite-dimensional Lie algebra g admits a 
natural symmetric bilinear form x : g x g — R that satisfies equation (5.2.26). It is 
called the Killing form and is defined by 


K(E,n) = trace(ad(é)ad(7)), Ene gq, (5.2.27) 


where ad : g — Der(g) is the adjoint representation in Example 2.5.23. The Killing 
form may have a kernel (which always contains the center of gq), and even if it is 
nondegenerate, it may be indefinite. 
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Exercise 5.2.26 Prove that «([E, 7], ¢) = «(&, [n, €]) for all €,, € € q. 


Exercise 5.2.27 Assume that g admits an invariant inner product. For each & € g 
prove that the derivation ad(&) is skew-adjoint with respect to this inner product and 
deduce that «(&, £) = —trace(ad(&)*ad(€)) = —|ad(&)|*. Deduce that the Killing 
form is nondegenerate whenever g has a trivial center and admits an invariant inner 
product. 


Example 5.2.28 (Right invariant Riemannian metric) Let G be any Lie sub- 
group of GL(n, R) (not necessarily contained in O(7)), and let g := Lie(G) = 7);G 
be its Lie algebra. Fix any inner product on the Lie algebra g (not necessarily in- 
variant under conjugation) and consider the Riemannian metric on G defined by 
(v,w)¢ := (vg-', wg!) (5.2.28) 


for v,w € T,G. This metric is called right invariant. 


(a) The map g +> ga is an isometry of G for every a € G. 
(b) Define the linear map A : g — End(q) by 


(ACE )n, £) = 5(8 En. 81) — (1.18.80 ~ (6 Enl) (5.2.29) 
for €&,,¢ € g. Then A is the unique linear map that satisfies 


A(E)+ A—E)* =0, = A(m)E — AE)n = [E, nl 


for all €,1 € g, where A(&)* is the adjoint operator with respect to the inner 
product on g. Let y : R — G be a smooth curve and X € Vect(y) be a smooth 
vector field along y. Then the covariant derivative of X is given by 


d So = 
VX = (Gar + A(py!)Xy ‘)y (5.2.30) 
(Exercise: Prove this. Moreover, if the inner produt on g is invariant, prove 


that A(&)n = —5IE, n] for all €, 7 € g.) 
(c) A smooth curve y : R — Gis a geodesic if and only if it satisfies 


Ge ees 
ar Pao yy! =0. (5.2.31) 


(Exercise: G is complete.) 
(d) The Riemann curvature tensor on G is given by 


(Re(Eg.ng)ég)g | = (AcE, nl) + [A€é), A(n)l)é 6.222) 


for g € Gand &,n,¢ € gq. (Exercise: Prove this.) 
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5.3 Generalized Theorema Egregium 


We will now show that geodesics, covariant differentiation, parallel transport, and 
the Riemann curvature tensor are all intrinsic, i.e. they are intertwined by isometries. 
In the extrinsic setting these results are somewhat surprising since these objects 
are all defined using the second fundamental form, whereas isometries need not 
preserve the second fundamental form in any sense but only the first fundamental 
form. 

Below we shall give a formula expressing the GauBian curvature of a surface M* 
in R? in terms of the Riemann curvature tensor and the first fundamental form. It 
follows that the Gaufian curvature is also intrinsic. This fact was called by GauB 
the “Theorema Egregium” which explains the title of this section. 


5.3.1 Pushforward 


We assume throughout this section that M C R” and M’ C R” are smooth sub- 
manifolds of the same dimension m. As in Sect. 5.1 we denote objects on M' by 
the same letters as objects in M with primes affixed. In particular, g’ denotes the 
first fundamental form on M’' and R’ denotes the Riemann curvature tensor on M’. 

Let 6: M > M' be a diffeomorphism. Using ¢ we can move objects on M to 
M". For example the pushforward of a smooth curve y : J > M is the curve 


dy = boy: ITM’, 
the pushforward of a smooth function f : M — R is the function 
dbf i= fog!: MOR, 


the pushforward of a vector field X € Vect(y) along a curve y: J > M is the 
vector field ¢, X € Vect(¢,y) defined by 


(GX) (t) = do(y())X() 


for t € 7, and the pushforward of a global vector field X € Vect(M) is the vector 
field @, X € Vect(M’) defined by 


(P.X )(P(p)) = do (p)X(p) 


for p € M. Recall that the first fundamental form on M is the Riemannian metric g 
defined as the restriction of the Euclidean inner product on the ambient space to each 
tangent space of M. It assigns to each p € M the bilinear map g,:7,M xT,M +R 
given by 


gp(u,v) = (u,v), u,v €T,M. 
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Its pushforward is the Riemannian metric which assigns to each p’ € M’ the inner 
product (@.2)p’ : TM’ x Ty M' — R defined by 


($+ 8)o(p) (dG (p)u, db(p)v) := gp(u, v) 


for p:= 6"'(p') € M and u, v € T,M. The pushforward of the Riemann curvature 
tensor is the tensor which assigns to each p’ € M’ the bilinear map (@, R))»’ : Tp) M'x 
Ty M' > L£(TyM', TyM’), defined by 


($.R)o(p) (db(p)u, db(p)v) := do(p)Rp(u, v)do(p) | 


for p:= @'(p')€ M andu,v € T,M. 


5.3.2 Theorema Egregium 


Theorem 5.3.1 (Theorema Egregium) The first fundamental form, covariant dif- 
ferentiation, geodesics, parallel transport, and the Riemann curvature tensor are 
intrinsic. This means that for every isometry 6 : M — M' the following holds. 


(i) beg = 8’. 
(i) If X € Vect(y) is a vector field along a smooth curve y : I > M, then 


V'(¢4X) = OVX, (5.3.1) 
and if X,Y € Vect(M) are global vector fields, then 
V5.x Px Y = ,(VxY). (5.3.2) 


(iii) [fy : I > M isa geodesic, thendoy: I > M' isa geodesic. 
(iv) Ify : I > M is asmooth curve, then for all s,t € I, we have 


®',.,(t,8)db(y(s)) = dot) ® (1,5). (5.3.3) 
(v) & R= R’. 


Proof Assertion (i) is simply a restatement of Theorem 5.1.1. To prove (ii) we 
choose a local smooth parametrization yy : 82 > U of an open set U C M, defined 
on an open set 2 CR”, so that y~!: U > Q is acoordinate chart. Suppose without 
loss of generality that y(t) € U for allt € J and define c : J > 2 andé: J > R”™ 
by 


m 


. 0 
WO=WeO), — XO= KOM Cw), 
i=1 


240 5 Curvature 


Recall from equations (3.6.6) and (3.6.7) that 


m . m ; : P) 
Vx) = [FOF DL Hee oo | Tew), 


k=1 ij=l 


where the Christoffel symbols ie : 2 — R are defined by 


ew m ‘ ow 
a) Oxiaxi Ds Ty axk 


Now consider the same formula for ¢, X using the parametrization 
W:=g¢0f: 2 >U':=d¢U)CM’. 


The Christoffel symbols I”’ : : 2 — R associated to this parametrization of U’ are 


defined by the same formula as the ry with w replaced by w’. But the metric tensor 
for y agrees with the metric tensor for wp’: 


(™ | (~ =| 
8ij = = 


Oxi? Ox/ Oxi? Axi 


Hence it follows from Lemma 3.6.5 that I”’ : = rk for alli, j,k. This implies that 
the covariant derivative of ¢,,X is given by 


m . m os Pade 
Vex =D (H+ DO ode | 


k=l ij=l 


(c) 


m 


m : ; . 0 
= db(w(c)) >) | & + Yo OF ee! aac) 
k=1 i,j=l 


= $,VX. 
This proves (5.3.1). Equation (5.3.2) follows immediately from (5.3.1) and Re- 
mark 5.2.4. 
Here is a second proof of (ii). For every vector field X € Vect(M) we define the 
operator Dy : Vect(M) — Vect(M) by 
DyY := $*(Vo,x«Y). 
Then, for all X,Y € Vect(M), we have 


DyX — DxY = $*(Vp,vGeX — Vp,xbx¥) = $*[heX, OY] = [X, ¥].- 
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Moreover, it follows from (i) that 


Pr Lx (VY, Z) = L4,x (bY, OZ) 
= (Vox PY, bsZ) + (PY, Vox bs Z) 
= ($,DxY,b.Z) + (bx¥, bx Dx Z) 
= ¢,((DxY, Z) + (¥, DxZ)). 


and hence Ly (Y, Z) = (Dy Y, Z) + (Y, Dy Z) for all X, Y, Z € Vect(M ). Thus the 
operator X +» Dy satisfies equations (5.2.9) and (5.2.10) and, by Lemma 5.2.7, it 
follows that Dy Y = Vy Y for all X, Y € Vect(M). This completes the second proof 
of (ii). 

We prove (iii). Since ¢@ preserves the first fundamental form it also preserves the 
energy of curves, namely 


E(poy) = Ely) 


for every smooth map y : [0, 1] ~ M. Hence y is a critical point of the energy func- 
tional if and only if ¢ 9 y is a critical point of the energy functional. Alternatively it 
follows from (ii) that 


f(a ee ee 


for every smooth curve y : J > M. If y is a geodesic, the last term vanishes and 
hence ¢ 0 y is a geodesic as well. As a third proof we can deduce (iii) from the 
formula ¢ (exp, (v)) = expy:,)(db(p)v) in the proof of Theorem 5.1.1. 

We prove (iv). For fg € J and vg € Ty))M define 


X(t) = P(t, t0)v0, X(t) = Dyoy (t, to) db(y (to))v0- 


By (ii) the vector fields X’ and ¢, X along ¢ 0 y are both parallel and they agree at 
t = to. Hence X’(t) = ¢, X(t) for all t € J and this proves (5.3.3). 

We prove (v). Fix a smooth map y : R? > M anda smooth vector field Z along 
y, and define 


y=goy:R? SM’, Z':=$,Z € Vect(y’). 
Then it follows from (ii) that 


RO by)Z! =VVIZ =z 
= $,(V;V,;Z — V;V;Z) 
= ddb(y)ROsy, 0:y)Z 
= (Px R)Osy’, O:y')Z’. 


This proves (v) and Theorem 5.3.1. Oo 
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The assertions of Theorem 5.3.1 carry over in slightly modified form to local 
isometries ¢ : M — M". In particular, the pushforward of a vector field on M un- 
der ¢ is only defined when ¢ is a diffeomorphism while the pushforward of a vector 
field along a curve is defined for any smooth map ¢. Also, the pushforward of the 
Riemann curvature tensor under a local isometry is only defined locally, and local 
isometries satisfy the local analogue of the first assertion in Theorema Egregium by 
definition. 


Corollary 5.3.2 (Theorema Egregium for Local Isometries) Every local isom- 
etry @: M > M‘ has the following properties. 


(i) Every vector field X along a smooth curve y : I — M satisfies (5.3.1). 
Gi) Ify: I > M isa geodesic, then soisgpoy:I > M'. 

(iii) Parallel transport along a smooth curve y : I — M satisfies (5.3.3). 
(iv) The curvature tensors R of M and R' of M' are related by the formula 


Rip) (db(p)u, db(p)v) = db(p) Rp(u, v)de(p) | (5.3.4) 
forall p € M andallu,v € T,M. 


Proof Let po € M. Then, by the Inverse Function Theorem 2.2.17, there exists an 
open neighborhood U C M of po such that U’ := @(U) is an open subset of M’ 
and the restriction ¢|y : U — U’ is a diffeomorphism. This restriction is an isom- 
etry by Theorem 5.1.1. Hence, by Theorem 5.3.1 the assertions (i) and (ii) hold for 
the restriction of y to Jp := ames to ) (a union of subintervals of 7) and (iv) holds 
for all p € U. Since these are local statements and po was chosen arbitrary, this 
proves (1), (ii), and (iv). Part (iii) follows directly from (i) as in the proof of Theo- 
rem 5.3.1 and this proves Corollary 5.3.2. oO 


The next corollary spells out a useful consequence of Corollary 5.3.2. For suffi- 
ciently small tangent vectors equation (5.3.5) below already appeared in the proof 
of Theorem 5.1.1 and was used in Lemma 5.1.10 and Exercise 5.1.11. When M@ 
is not complete, recall the notation V, C 7M for the domain of the exponential 
map of M at a point p (Definition 4.3.5). For p’ € M’ denote the domain of the 
exponential map by V,, C TM’. 


Corollary 5.3.3. Let ¢: M — M' be a local isometry and let p € M. Then 
dp(p)Vp C V5(,) and, for every v € Vp, 


$(exp,(v)) = exply.,) (db(p)v). (5.3.5) 
Proof Let v € V, C T,M and define y(t) := exp,(tv) for 0 <r < 1. Then 
y: [0,1] > M is a geodesic by Lemma 4.3.6, and hence so is the curve y’:= doy: 
[0, 1] > M' by Corollary 5.3.2. Moreover, 


y'0) = (70) =o), ¥'() = db(v(0)) ¥0) = db(p)v 
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by the chain rule. Hence it follows from the definition of the exponential map (Def- 


inition 4.3.5) that do(p)v € V,v,) and 


EXP) (dP(p)v) = y'(1) = $(y(1)) = b(exp, (v)). 


This proves Corollary 5.3.3. O 


5.3.3 Gauian Curvature 


As a special case we shall now consider a hypersurface M C R’t!, i.e. a smooth 
submanifold of codimension one. We assume that there exists a smooth map v : 
M -> R’*! such that, for every p € M, we have 


v(p) L T,M, |v(p)| = 1. 


Such a map always exists locally (see Example 3.1.3). Note that v(p) is an element 
of the unit sphere in Rt! for every p € M and hence we can regard v as a map 
from M to S’”, i.e. v : M — S’. Such a map is called a Gauf map for MW. Note 
that if v : M — S’ is a GauB map, so is —v, but this is the only ambiguity when 
M is connected. Differentiating v at p € M we obtain a linear map 


dv(p):T,M > Typ S" =T,M 


Here we use the fact that T,(p))S” = v( p)* and, by definition of the Gau8 map v, 
the tangent space of M at p is also equal to v(p)*+. Thus dv(p) is a linear map 
from the tangent space of M at p to itself. 


Definition 5.3.4 The GauBian curvature of the hypersurface M is the real valued 
function K : M — R defined by 


K(p) := det(dv(p) : T,M — T,M) 


for p € M. (Replacing v by —v has the effect of replacing K by (—1)”K; so K is 
independent of the choice of the Gaus map when 7 is even.) 


Remark 5.3.5 Given a subset B C M, the set v(B) C S” is often called the 
spherical image of B. If v is a diffeomorphism on a neighborhood of B, the change 
of variables formula for an integral gives 


[vs = [Kim 
B 


v(B) 
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Here jy and jzs denote the volume elements on M and S”, respectively. Introduc- 
ing the notation Areay(B) := /, 3 em We obtain the formula 


IK(p)| = fim Sas) 
Bop Ar cay (B) 

This says that the curvature at p is roughly the ratio of the (m-dimensional) area of 

the spherical image v(B) to the area of B where B is a very small open neighbor- 

hood of p in M. The sign of K(p) is positive when the linear map dv(p) : T, M > 

T,M preserves orientation and negative when it reverses orientation. 


Remark 5.3.6 We see that the GauBian curvature is a natural generalization of 
Euler’s curvature for a plane curve. Indeed if M C R? is a 1-manifold and p € M, 
we can choose a curve y = (x, y) : (—é, €) > M such that y(0) = p and |p(s)| = 1 
for every s. This curve parametrizes M by the arclength and the unit normal vector 
pointing to the right with respect to the orientation of y is v(x, y) = (y, —x). This 
is a local Gau8 map and its derivative (j}, —X) is tangent to the curve. The inner 
product of the latter with the unit tangent vector y = (x, y) is the GauBian curvature. 
Thus 


__ ax d’y dyd?x _ dé 
“ds ds? ds ds? ds 


where s is the arclength parameter and @ is the angle made by the normal (or the 
tangent) with some constant line. With this convention K is positive at a left turn 
and negative at a right turn. 


Exercise 5.3.7 The Gaufian curvature of an m-dimensional sphere of radius r 
is constant and has the value r~” (with respect to an outward pointing Gau8 map 
when 7m is odd). 


Exercise 5.3.8 Show that the Gaufian curvature of the surface z = x? — y* is —4 
at the origin. 


We now restrict to the case of surfaces, i.e. of 2-dimensional submanifolds 
of R>. Figure 5.1 illustrates the difference between positive and negative GauBian 
curvature in dimension two. 


Fig. 5.1 Positive and nega- 


tive GauBian curvature ——_ =] 
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Theorem 5.3.9 (Gaufian curvature) Let M C R? be a surface and fix a point 
p«€M. Ifu,v €T,M isa basis, then 


(R(u, v)v,u) 


KP) = Tee = 0? 


(5.3.6) 


Moreover, 
R(u,v)w = —K(p)(v(p), u X v) v(p) x w 63:7) 
forallu,v,w €T,M. 


Proof The orthogonal projection of R? onto the tangent space 7,M = v(p)+ is 
given by the 3 x 3-matrix 


II(p) = 1— v(p)v(p)". 
Hence 
dII(p)u = —v(p)(dv(p)u)' — (dv(p)u)v(p)". 


Here the first summand is the second fundamental form, which maps 7,M to 
T,M ae and the second summand is its dual, which maps T, M + to T,M. Thus 


= 
hy(v) = —v(p)(dv(p)v) :T,M—> On daa 
hy(u)* = —(dv(p)u)v(p)' : T, M+ > T,M. 

By the Gau8—Codazzi formula this implies 


R,(u, v)w = hp(u)*hp(v)w — hy(v)*h,(u)w 


a (dv(p)u) (dv(p)v)'w - (dv(p)v) (dv(p)u)'w 
= (dv(p)v, w)dv(p)u — (dv(p)u, w)dv(p)v 


and hence 
(R,(u, v)w,z) = (dv(p)u, z)(dv(p)u, w) — (dv(p)u, w)(dv(p)v,z). (5.3.8) 


Now fix four tangent vectors u,v, w,Z € T,M and consider the composition 


R? “, R34, R? S R3 
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of the linear maps 

AE := E!v(p) + Bu t+ Ev, 
dv(p)n, ifn 1 v(p), 
0 ifn € Rv(p), 


(¢, v(p)) 
(¢, 2) 
(¢, w) 


This composition is represented by the matrix 


Bni= 


Cl: 


1 0 0 
CBA= |0 (dv(p)u,z) (dv(p)v,z) 
0 (dv(p)u,w) (dv(p)v, w) 
Hence, by (5.3.8), we have 
(R,(u, v)w,z) = det(CBA) 
= det(A) det(B) det(C) 
= (v(p),u x v) K(p){v(p),z x w) 
= —K(p)(v(p),u x v)(v(p) x w, 2). 
This implies (5.3.7) and 
(Rp(u,v)v,u) = K(p){v(p), u x v)? 
= K(p)|u x v|? 
= K(p)(Iul?loP? = (u,v)?). 
This proves Theorem 5.3.9. O 


Remark 5.3.10 Equation (5.3.6) implies 


(Rp(u, v)w,z) = K(p)((u, 2)(v, w) = (u, w)(v,2)) (5.3.9) 
for all p € M and all u,v,w,z € T,M. This is proved in Theorem 6.4.8 below. 
Exercise: Deduce this formula from (5.3.7). 

Corollary 5.3.11 (Theorema Egregium of GauB) The Gaufian curvature is in- 
trinsic, i.e. if 

¢:M— M' 
is an isometry of surfaces in R?, then 


K=K'o@:M-—R. 


Proof Theorem 5.3.1 and Theorem 5.3.9. O 
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Exercise 5.3.12 Form = 1 the Gau®ian curvature is clearly not intrinsic as any two 
curves are locally isometric (parameterized by arclength). Show that the curvature 
K(p) is intrinsic for even m while its absolute value |K(p)| is intrinsic for odd 
m > 3. Hint: We still have the equation (5.3.8) which, for z = u and v = w, can 
be written in the form 


(dv(p)u,u) oo) 


CD eee (dv(p)v, ») 


Thus, for every orthonormal basis v1, ..., Um of TM, the 2 x 2 minors of the matrix 

are intrinsic. Hence everything reduces to the following assertion. 

Lemma The determinant of anm Xm matrix is an expression in its 2 x 2 minors if 

m is even; the absolute value of the determinant is an expression in the 2 x 2 minors 

ifm is odd and greater than or equal to 3. 

The lemma is proved by induction on m. For the absolute value, note the formula 
det(A)” = det(det(A)1,,) = det(AB) = det(A) det(B) 


for an m x m-matrix A where B is the transposed matrix of cofactors. 


5.4 Curvature in Local Coordinates* 
5.4.1 Riemann 


Let M C R* be an m-dimensional manifold and let ¢ = y~! : U > 2 bea local 
coordinate chart on an open set U C M with values in an open set 2 C R’”. Define 
the vector fields F,,..., E,, along y by 


0 
i (x) E Ty(x)M. 


E; (x) = aad 


These vector fields form a basis of Ty(,)M for every x € 2 and the coeffi- 
cients gj; : 82 — R of the first fundamental form are gj; = (E iF i). Recall from 
Lemma 3.6.5 that the Christoffel DK : 22 — R are the coefficients of the Levi-Civita 
connection, defined by 


m 


ViEj = DTG EE 
k=1 
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and that they are given by the formula 


m 


= 268" ~ (3: gj0 + 9)gie — 808). 


Define the coefficients ied , 1 & > Rand Rijx¢ : 2 — R of the Riemann curvature 
tensor by 


m 


R(E;, E;)Ex = ‘> Ri, Es, (5.4.1) 
Rijxe = (R(Ej, Ej) Ex, Ec) = 5 Ri Sve (5.4.2) 

These coefficients are given by 
Roig = OT — OT + > (ni Arh — 06M). (5.4.3) 


The coefficients of the Riemann curvature tensor have the symmetries 
Rijce = —Ryixe = —Rijee = Recij (5.4.4) 
and the first Bianchi identity has the form 
Rik + Rei + Rij = 0, Rijke + Rjeie + Reijge = 0. (5.4.5) 


Warning: Care must be taken with the ordering of the indices. Some authors use 
the notation Rij for what we call Rie and Rex;; for what we call Rjjxe. 


Exercise 5.4.1 Prove equations (5.4.3), (5.4.4), and (5.4.5). Use (5.4.3) to give an 
alternative proof of Theorem 5.3.1. 


5.4.2 GauB 


If M C R" is a 2-manifold (not necessarily embedded in R*), we can use equa- 
tion (5.3.6) as the definition of the GauBian curvature K:M—>R.Letw:2—-U 
be a local parametrization of an open set U C M defined on an open set 2 C R?. 
Denote the coordinates in R? by (x, y) and define the functions E, F,G : 2 > R 
by 


=|a0?, F:= (adv), G:=|dv/. 
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We abbreviate D := EG — F?. Then the composition of the Gauian curvature 
K: M — R with the parametrization yw is given by 


‘ E F dy F — 30,.G 
Koy=- det] F G 59,G 
i 1 
50,E 0,F —j50,E —30E + 0,0,F —58G 
i E F  5d,E 
—~— det] F G  40.G 
S0jE 50,G 0 
1 o (Se) 


~~ 94/D Ox BD 
4: 1 {| 
2./D dy EJ/D 


re 0 (Rone) 1 — 
——-24/D ax JD 2/D dy JD 
: E aE dE 
—~—det] F a:F 0)F 
2 x J 
sie G aG a,G 


This expression simplifies dramatically when F = 0 and we get 


1 ( axG_ 8 if) 
2VEG\0x JEG dy JEG) 


Exercise 5.4.2 Prove that the Riemannian metric 


Kow= (5.4.6) 


4 


E=G6=—2.—»: 
Ck ee a 


F=0, 


on R? has constant constant curvature K = 1 and the Riemannian metric 


4 


E= = ———————  _______ 
G (1 — x2 — y2)2’ 


F=0, 


on the open unit disc has constant curvature K = —1. 


Check for 
updates 


Geometry and Topology 


In this chapter we address what might be called the “fundamental problem of intrin- 
sic differential geometry”: when are two manifolds isometric? The central tool for 
addressing this question is the Cartan—Ambrose—Hicks Theorem (Sect. 6.1). In the 
subsequent sections we will use this result to examine flat spaces (Sect. 6.2), sym- 
metric spaces (Sect. 6.3), and constant sectional curvature manifolds (Sect. 6.4). 
The chapter then examines manifolds of nonpositive sectional curvature and in- 
cludes a proof of the Cartan Fixed Point Theorem (Sect. 6.5). The last three sections 
introduce the Ricci tensor and show that complete manifolds with uniformly posi- 
tive Ricci tensor are compact (Sect. 6.6) and discuss the scalar curvature (Sect. 6.7) 
and the Weyl tensor (Sect. 6.8). 


6.1 The Cartan—Ambrose-Hicks Theorem 


The Cartan—Ambrose—Hicks Theorem answers the question (at least locally) when 
two manifolds are isometric. In general the equivalent conditions given there are 
probably more difficult to verify in most examples than the condition that there 
exist an isometry. However, under additional assumptions it has many important 
consequences. The section starts with some basic observations about homotopy and 
simple connectivity. 


6.1.1 Homotopy 


Definition 6.1.1 Let M be a manifold and let J = [a, b] be a compact interval. A 
(smooth) homotopy of maps from J to M is asmooth map y : [0,1] x J ~ M. We 
often write y,(t) = y(A, t) for A € [0, 1] and ¢ € J and call y a (smooth) homotopy 
between yo and y;. We say the homotopy has fixed endpoints if y,(a@) = yo(a) 
and yi (b) = yo(b) for all A € [0, 1]. (See Fig. 6.1.) 
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Fig. 6.1 A homotopy with 


fixed endpoints VA 


We remark that a homotopy and a variation are essentially the same thing, 
namely a curve of maps (curves). The difference is pedagogical. We used the word 
“variation” to describe a curve of maps through a given map; when we use this 
word we are going to differentiate the curve to find a tangent vector (field) to the 
given map. The word “homotopy” is used to describe a curve joining two maps; it 
is a global rather than a local (infinitesimal) concept. 


Definition 6.1.2 A manifold M is called simply connected iff for any two 
curves Yo, 71 : [a,b] > M with yo(a) = y;(a) and yo(b) = y,(b) there exists a 
homotopy from yo to y; with endpoints fixed. (The idea is that the space 2,,, of 
curves from p to q is connected.) 


Remark 6.1.3. Two smooth maps yo, 71 : [a,b] > M with the same endpoints can 
be joined by a continuous homotopy if and only if they can be joined by a smooth 
homotopy. This follows from the Weierstrass approximation theorem. 


Remark 6.1.4 Assume M is a connected smooth manifold. Then the topological 
space §2, of all smooth curves in M with the endpoints p and gq is connected for 
some pair of points p,q € M if and only if it is connected for every pair of points 


P.q € M. (Prove this!) 


Example 6.1.5 The Euclidean space R” is simply connected; any two curves 
Yo, 1: [a, b] > R” with the same endpoints can be joined by the homotopy y, (ft) := 
yo(t) + A(vi(t) — yo(t)). 


Example 6.1.6 The punctured plane C \ {0} is not simply connected; two curves 
of the form 


Yn(t) = eorint 0 < t < 1, 
are not homotopic with fixed endpoints for distinct integers n. 


Exercise 6.1.7 Prove that the m-sphere S”” is simply connected for m # 1. 
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6.1.2. The Global C-A-H Theorem 


Theorem 6.1.8 (Global C-A-H Theorem) Let M CR" and M’ CR” be 
nonempty, connected, simply connected, complete m-manifolds. Fix two elements 
Po € M and py € M' and let ®y : T,,.M — T,, M' be an orthogonal linear iso- 
morphism. Then the following are equivalent. 


(i) There exists an isometry @ : M — M' satisfying 
$(Po) = Pp. (Po) = Po. (6.1.1) 
(ii) If (®, y, y') is a development satisfying the initial condition 
yO)= po, yvO=p~, (0) =o, (6.1.2) 
then 
yYD)=po = y'(1) = po, PU) = Go 
(iii) If (o, yo. v4) and (®1, v1, y;) are developments satisfying (6.1.2), then 
yo) = 1) = yp) = y; (1). 
(iv) If (®, y, y’) is a development satisfying (6.1.2), then ®, Ry = Rj, 


Example 6.1.9 Before giving the proof let us interpret the conditions in case 
M and M’ are two-dimensional spheres of radius r and r’ respectively in three- 
dimensional Euclidean space R3. Imagine that the spheres are tangent at po = Po: 
Clearly the spheres will be isometric exactly when r = r’. 

Condition (ii) says that if the spheres are rolled along one another without sliding 
or twisting, then the endpoint y’(1) of one curve of contact depends only on the 
endpoint y(1) of the other and not on the intervening curve y(t). This condition is 
violated in the case r 4 r’ (see Fig. 6.2). 

By Theorem 5.3.9 the Riemann curvature of a 2-manifold at p is determined by 
the GauBian curvature K(p); and for spheres we have K(p) = 1/r?. 


Fig. 6.2 Diagram for Exam- M 
ple 6.1.9 Sy, 
(wn 
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Exercise 6.1.10 Let y be the closed curve which bounds an octant as shown in the 
diagram for Example 6.1.9. Find y’. 


Exercise 6.1.11 Show that in case M is two-dimensional, the condition (1) = Bp 
in Theorem 6.1.8 may be dropped from (ii). 


Lemma 6.1.12 Let 6: M — M' be a local isometry and let y: I + M bea 
smooth curve on an interval I. Fix an element to € I and define 


Po:= Y(t),  qo:= (po), Po := dh(po). (6.1.3) 


Then there exists a unique development (®, y, y') of M along M' on the entire 
interval I satisfying the initial conditions 


y' (to) = qo, P(to) = Do. (6.1.4) 
This development is given by 
YO=o70), Pt) =dd(v@) (6.1.5) 
fort el. 


Proof Define 


Y@):=b67M), 0) := dd(v()) 


fort € J. Then y’(t) = &(t) p(t) for all t € J by the chain rule, and every vector 
field X along y satisfies 6VX = V’(@X) by Corollary 5.3.2. Hence (9, y, y’) is 
a development by Lemma 3.5.19. By (6.1.3) this development satisfies the initial 
condition (6.1.4). Hence the assertion follows from the uniqueness result for devel- 
opments in Theorem 3.5.21. This proves Lemma 6.1.12. O 


Proof of Theorem 6.1.8 We first prove a slightly different theorem. Namely, we 
weaken condition (i) to assert that @ is a local isometry (i.e. not necessarily bi- 
jective), and prove that this weaker condition is equivalent to (ii), (ii), and (iv) 
whenever M is connected and simply connected and M’ is complete. Thus we drop 
the hypotheses that M be complete and M’ be connected and simply connected. 

We prove that (i) implies (ii). Given a development as in (ii) we have, by 
Lemma 6.1.12, 


y'() = 67) = b(Po) = Po, = PU) = d(y(1)) = 46 (p0) = ®o, 


as required. 

We prove that (ii) implies (iii) when M’ is complete. Choose developments 
(®;, y;, y;) for i = 0, 1 as in (iii). Define a curve y : [0, 1] + M by “composition”, 
16; 


yo(2t), 0<t<1/2, 


t):= 
VO =) (2-20), 1/2<t <1, 
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so that y is continuous and piecewise smooth and y(1) = po. By Theorem 3.5.21 
there exists a development (9, y, y’) on the interval [0, 1] satisfying (6.1.2) (because 
M’ is complete). Since y(1) = po it follows from (ii) that y’(1) = pg and (1) = Do. 
By the uniqueness of developments and the invariance under reparametrization, we 
have 


(@(1), 0), pO) = {M2 %2D. KAD). O<1<1/2, 
(® (2 — 21), vi(2— 2t), y;(2—2t)), 1/2<t<1. 
Hence yj(1) = y’(1/2) = y; (1) as required. 

We prove that (iii) implies (i) when M’ is complete and M is connected. Define 
the map ¢ : M — M’ as follows. Fix an element p € M. Since M is connected, 
there exists a smooth curve y : [0,1] — M such that y(0) = po and y(1) = p. 
Since M’ is complete, there exists a development (%,y,y’) with y’(0) = po 
and ®(0) = @p (Theorem 3.5.21). Now define ¢(p) := y’(1). By (iii) the end- 
point p’:= y’(1) is independent of the choice of the curve y, and so @ is 
well-defined. We prove that this map ¢ satisfies the following 


(a) If (®, y, y’) is a development satisfying y(O) = po, y'(0) = po, (0) = po, 
then o(y(t)) = y(t) for0 <t <1. 

(b) If p.q € M satisfy 0 < d(p,q) < inj(p; M) and d(p,q) < inj(¢(p): M’), 
then d'(b(p). 6(q)) = a(p. 9). 


That @ satisfies (a) follows directly from the definition and the fact that the 
triple (®,, y,,y;) defined by ®,(s) := ®(st), y,(s) := y(st), yvi(s) = y'(st) 
for 0 < s < 1 isa development. To prove (b), choose v € 7M such that 


|u| = d(p,q) exp, (v) = 


(Theorem 4.4.4) and let y : [0, 1] ~ M be a smooth curve with 
y(0) = po, y(t) = exp, ((2t — 1)v) 


for 5 <t <1. Let (@, y, y’) be the unique development of M along M’ satisfy- 
ing y’(0) = po and (0) = ®p (Theorem 3.5.21). Then, by (a), 


Y=), y'd)=¢@). 


Also, by part (ii) of Lemma 3.5.19 with X = y, the restriction of y’ to the in- 
terval [}. 1] is a geodesic. Thus y'(t) = expg,,)((2¢ — I)v’) for 5 <t <1, where 
the tangent vector v’ € Typ) M’ is given by v' := (5) = D(5)v and hence sat- 
isfies |v'| = |v| = d(p,q) < inj(¢(p); M’). Thus it follows from Theorem 4.4.4 


that d'($(p). 6(q)) = d'(P(D), exPiy,) (v')) = |v'| = d(p,q) and this proves (b). 
It follows from (b) and Theorem 5.1.1 that ¢ is a local isometry. 
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We prove that (i) implies (iv). Given a development as in (ii) we have 


YO=¢7M), Pt) =de(y@) 


for every t, by Lemma 6.1.12. Hence it follows from part (iv) of Corollary 5.3.2 
(Theorema Egregium for local isometries) that 


P(t), Ry) = (¢..R) = Ry) 
for all ¢ as required. 
We prove that (iv) implies (iii) when M’ is complete and M is simply con- 
nected. Choose developments (®;, y;, y;) for i = 0, 1 as in (iii). Since M is simply 
connected there exists a homotopy 


[0,1] x [0,1] >M: A,r yA,t) = y(t) 


from yo to y; with endpoints fixed. By Theorem 3.5.21 there is, for each A, a devel- 
opment (®,, y,, y;,) on the interval [0, 1] with initial conditions 


¥,(0) = po, Pi (0) = Bo 


(because M’ is complete). The proof of Theorem 3.5.21 also shows that yj (t) and 
®)(t) depend smoothly on both ¢ and A. We must prove that 


yi() = yo(1). 
To see this we will show that, for each fixed ft, the curve 
Ar (®,(t), (0). 40) 


is a development; then by the definition of development we have that the curve 
A+ y} (1) is smooth and 


dy,0) = &(1)day,(1) = 0 


as required. 

First choose a basis e],..., €m Of T,,M and extend it to obtain vector fields 
E; € Vect(y) along the homotopy y by imposing the conditions that the vector 
fields t > E;(A,t) be parallel, i.e. 


V,Ei(A,t)=0, £;(A,0) =e;. (6.1.6) 


Then the vectors £)(A,t),...£m(A,t) form a basis of T,,~)M for all A and f. 
Second, define the vector fields E/ along y’ by 


El(A,t) = ®,(t)E;(A,t) (6.1.7) 
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so that V/ E; = 0. Third, define the functions €!,...,&’” : [0, 1]? > R by 


m 


Oy= > EF, day = > EE). (6.1.8) 
i=1 i=1 


Here the second equation follows from (6.1.7) and the fact that 6,0,y = 0;y’. 
Now consider the vector fields 


X= 077", Y/ := WE; (6.1.9) 


along y’. They satisfy the equations 


i=] 


i=l 
and 


VY) = VV, E; — ViV,E; 
= R'(d,y', dy’) E;. 


To sum up we have X'(A,0) = Y/(A,0) = 0 and 
VX’ = Do (a,8' E; + 'Y/), VaR Oy 2. (6.1.10) 


i=l 
On the other hand, the vector fields 
x’ = DO, Y/ = DV E; (6.1.11) 


along y’ satisfy the same equations, namely 


VX! = QV,dy = BV. = PV S rs) 
i=1 


= ©, 5° (a8 E; + VE) = > (0,8 E; + €'Y/), 
i=1 i=, 
ViY/ = ®(V,ViE; — ViVi Ei) = ®,. R(0ry, Ory) E; 
= R'(®,0;y. D0, 7) PD, E; = R'(0,y', X')E;. 


Here the last but one equation follows from (iv). 
Since the tuples (6.1.9) and (6.1.11) satisfy the same differential equation (6.1.10) 
and vanish at t = 0 they must agree. Hence 


day’ = Didzy, VE; = ®VE; 
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fori = 1,...,m. This says that A +> (®,(¢), y(t), vy, (¢)) is a development. For 
t = 1 we obtain 0, y’(A, 1) = 0 as required. 

Now the modified theorem (where ¢ is a local isometry) is proved. The original 
theorem follows immediately. Condition (iv) is symmetric in M and M’. Thus, if 
we assume (iv), there are local isometries @ : M — M’ andy : M’ > M satisfying 
$(Po) = Ph. db(po) = Go and W(pi,) = po. dW(p)) = %". But then yo isa 
local isometry with yw o }(po) = po and d(v o })(po) = id. Hence y o © is the 
identity. Similarly ¢ o w is the identity so ¢ is bijective (and w = ¢~!) as required. 
This proves Theorem 6.1.8. O 


Remark 6.1.13 The proof of Theorem 6.1.8 shows that the various implications 
in the weak version of the theorem (where ¢ is only a local isometry) require the 
following conditions on M and M’: 


(i) always implies (ii), (iii), and (iv); 

(ii) implies (iii) whenever M’ is complete; 

(iii) implies (i) whenever M’ is complete and M is connected; 

(iv) implies (iii) whenever M’ is complete and M is simply connected. 


Remark 6.1.14 The proof that (iii) implies (i) in Theorem 6.1.8 can be slightly 
shortened by using the following observation. Let ¢ : M — M’ be a map be- 
tween smooth manifolds. Assume that @ © y is smooth for every smooth curve 
y : [0,1] > M. Then ¢ is smooth. 


Corollary 6.1.15 Let M and M' be nonempty, connected, simply connected, com- 
plete Riemannian manifolds and let ¢ : M — M' be a local isometry. Then @ is 
bijective and hence is an isometry. 


Proof This follows by combining the weak and strong versions of the global 
C-A-H Theorem 6.1.8. Let po € M and define pp) := (po) and Do := dd(po). 
Then the tuple M, M’, po, pj, Po satisfies condition (i) of the weak version of The- 
orem 6.1.8. Hence this tuple also satisfies condition (iv) of Theorem 6.1.8. Since M 
and M’ are connected, simply connected, and complete we may apply the strong 
version of Theorem 6.1.8 to obtain an isometry yy: M — M' satisfying W(po) = pp 
and dy (po) = ®o. Since every isometry is also a local isometry and M is connected 
it follows from Lemma 5.1.10 that 6(p) = w(p) for all p €¢ M. Hence ¢ is an isom- 
etry, as required. Oo 


Remark 6.1.16 Refining the argument in the proof of Corollary 6.1.15 one can 
show that a local isometry ¢ : M — M’ must be surjective whenever M is com- 
plete and M’ is connected. None of these assumptions can be removed. (Take an 
isometric embedding of a disc in the plane or an embedding of a complete space M 
into a space with two components, one of which is isometric to M.) 

Likewise, one can show that a local isometry ¢ : M — M’ must be injective 
whenever M is complete and connected and M’ is simply connected. Again none 


6.1 The Cartan—Ambrose-Hicks Theorem 259 


of these asumptions can be removed. (Take a covering R — S', or a covering of a 
disjoint union of two isometric complete simply connected spaces onto one copy of 
this space, or some noninjective immersion of a disc into the plane and choose the 
pullback metric on the disc.) 
6.1.3 The Local C-A-H Theorem 
Theorem 6.1.17 (Local C-A-H Theorem) Let M and M’ be smooth m- 
manifolds, let pp € M and po € M', and let ®y : T,,M — TM’ be an orthogonal 
linear isomorphism. Let r > 0 be smaller than the injectvity radii of M at po and 
of M' at po and define 

U,:={p€M|d(po,p)<r}, Up := tp! = M'|d'(pp, p’) <r}. 
Then the following are equivalent. 
(i) There exists an isometry $ : U, — U; satisfying (6.1.1). 
(ii) If (®, y, y’) is a development on an interval I C R with 0 € I, satisfying the 

initial condition (6.1.2) as well as y(I) C U, and y'(I) C Uy, then 

yQ) = Po = y'(1) = po, (1) = ®. 
(iii) If (®o, yo, yy) and (®i, v1, y;) are developments as in (ii), then 
yol)= 0) = yo(1) = yj (1). 
(iv) [fv € T,,M with |v| < r and 
y(t) := exp, (tv), y(t) = exp, (tov), P(t) = D,,(t,0) PoP, (0, t), 

then ®(t). Ry) = R) a) for 9 <t<l. 

If these equivalent conditions are satisfied, then 
(EXP), (0)) = exPly (Pov) 

forallv €T,.M with |v| <r. 


The proof is based on the following lemma. 


Lemma 6.1.18 Let p ¢ M andv,w €T,M. For0 <t < | define 


a 
y(t) := exp(tv), X(t) = aA Poo + Aw)) € T,a)M. 
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Then 
V,V,X + R(X, y)y = 9, X(0) = 0, V,X(0) = w. (6.1.12) 


A vector field along y satisfying the first equation in (6.1.12) is called a Jacobi field 
along y. 


Proof Define 
y(A,t) = exp, (t(v + Aw)), X(A,t) := dy, t) 


for all A and t. Since y(A, 0) = p for all A we have X(A, 0) = 0 and 


d 
V,X(A,0) = V;d,y(A, 0) = Vad: (A, 0) = a +dAw) = vw. 


Moreover, V;0;y = 0 and hence 


ViV:X = V;V: 0a 
= V,V,01y — VaVi0rY 
= Ry, 0,7)0:Y 
= R(0;y, X)dry. 


This proves Lemma 6.1.18. oO 


Proof of Theorem 6.1.17 The proofs (i) = > (ii) => (iii) (1) (iv) are 
as before; the reader might note that when L(y) < r we also have L(y’) < r for 
any development so that there are plenty of developments with y : [0, 1] — U, and 
y’ : [0, 1] + U/. The proof that (iv) implies (i) is a little different since (iv) here is 
somewhat weaker than (iv) of the global theorem: the equation ®, R = R’ is only 
assumed for certain developments. 

Hence assume (iv) and define @ : U, > U/ by 


o:= exp, 0 Py o exp, :U, > UY. 


We must prove that ¢@ is an isometry. Thus we fix a point g € U, and a tangent vector 
u € T,M and choose v, w € T,M with |v| <r such that 


exp,,(v) =4,  d exp,,(v)w =u. (6.1.13) 


Define y : [0,1] > U,, vy’: [0,1] > UJ, X © Vect(y), and X’ € Vect(y’) by 


y(t) = exp,, (tv), X(t) = exp, (f(v + Aw)), 
0 


) 
dA |,= 


) 
y(th= exp, (tov), X(t) := a 


| eXPi,, (t(fov + ADow)). 
Ate 8 
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Then, by definition of ¢, we have 
y':= oy, do(y)X = X'. (6.1.14) 
By Lemma 6.1.18, X is a solution of (6.1.12) and X’ is a solution of 
ViVi:X' = Ry, X )ary’, X'A,0) =0, ViX'(A,0) = Dow. (6.1.15) 
Now define ®(t) : Ty)M > T,q)M' by 
P(t) := P(t, 0) Hy, (0, t). 


Then @ intertwines covariant differentiation. Since y and y’ are parallel vector 
fields with p’(0) = ov = H(0)7(0), we have 


P(t)y(t) = y'(0) 


for every t. Moreover, it follows from (iv) that ®,R, = Ry). Combining this 
with (6.1.12) we obtain 


ViVi(@X) = OV, V,X = R'(Py, OX) OY = R'(y', ®X)y’. 


Hence the vector field X along y’ also satisfies the initial value problem (6.1.15) 
and thus 


@X = X' = dd(y)X. 
Here we have also used (6.1.14). Using (6.1.13) we find 
y(1) = exp, (v) = 4, X(1) = d exp, (v)w = u, 
and so 
do(q)ju = db(y(1))X() = XI) = P(1)u. 
Since ®(1) : T,q)M — T,q)M' is an orthogonal transformation this gives 


|dp(q)u| = |@C)ul = |ul. 


Hence ¢ is an isometry as claimed. This proves Theorem 6.1.17. A 


6.2 Flat Spaces 


Our aim in the next few sections is to give applications of the Cartan-Ambrose- 
Hicks Theorem. It is clear that the hypothesis ®,.R = R’ for all developments will 
be difficult to verify without drastic hypotheses on the curvature. The most drastic 
such hypothesis is that the curvature vanishes identically. 
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Definition 6.2.1 A Riemannian manifold / is called flat iff the Riemann curva- 
ture tensor R vanishes identically. 


Theorem 6.2.2 Let M C R” be a smooth m-manifold. 


G) M is flat if and only if every point has a neighborhood which is isometric to 
an open subset of R”, i.e. at each point p € M there exist local coordinates 
x!,...,x! such that the coordinate vectorfields E; = 0/0x' are orthonormal. 

(ii) Assume M is connected, simply connected, and complete. Then M is flat if and 


only if there is an isometry ¢ : M — R"” onto Euclidean space. 


Proof Assertion (i) follows immediately from Theorem 6.1.17 and (ii) follows 
immediately from Theorem 6.1.8. Oo 


Exercise 6.2.3. Carry over the Cartan—Ambrose-Hicks theorem and Theorem 6.2.2 
to the intrinsic setting. 


Exercise 6.2.4 A one-dimensional manifold is always flat. 
Exercise 6.2.5 If M, and M> are flat, sois M = M, x Mp. 


Example 6.2.6 By Exercises 6.2.4 and 6.2.5 the standard torus 
T” = {z = (Z],...,Zm) € C” | Izy, =--- = |Zm| = 1} 


is flat. 


Exercise 6.2.7. For a,b > 0 andc > 0 define M Cc C? by 


M := M(a,b,c) = {(u,v,w) € C? | lu] =a, |v] = b, w= cnet. 
a 


Then M is diffeomorphic to a torus (a product of two circles) and M is flat. If 
a',b’ > 0 and c’ > 0, prove that there is an isometry ¢@ from M’ = M(a’,b’,c’) 
to M = M(a,b,c) if and only if the triples (a’, b’, c’) and (a, b,c) are related by a 
permutation. 

Hint: Show first that an isometry ¢ : M’ — M that satisfies the condition 
o(a',b',c’) = (a,b,c) must have the form 


Psat gat w\* (uP wu’ fu'\? ul \&tY fy! \8r8 
d(u',v',w') = (+(*) (5) 4(=) (5) (5) (5) 


for integers a, B, y, 6 that satisfy #5 — By = +1. Show that this map ¢ is an isom- 
etry if and only if 


a? 4c? =o7a? + yb? + @t y)’c?, 
c” = aBa? + ybb? + (a+ y)(B + 8)c?, 


ge eae _ Prt rr + (B + 6)°c?. 
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Exercise 6.2.8 (Developable manifolds) Let n = m + 1 and let E(t) be a one- 
parameter family of hyperplanes in R”. Then there exists a smooth map u : R > R" 
such that 


E(t) = u(t)*, \u(t)| = 1, (6.2.1) 


for every t. We assume that u(t) 4 0 for every t so that u(t) and w(t) are linearly 
independent. Show that 


Li) := ut) na)t = lim E(t) M E(s). (6.2.2) 


Thus L(t) is a linear subspace of dimension m— 1. Now let y : R > R” be a smooth 
curve such that 


(y().u@)) =0, (y(t), u(t)) FO (6.2.3) 


for all t. This means that p(t) € E(t) and p(t) ¢ L(t); thus E(t) is spanned by L(t) 
and y(t). Fort € R and ¢ > 0 define 


L(t)e := {v € L(t) | |v] < e}. 


Let J C R be a bounded open interval such that the restriction of y to the closure 
of I is injective. Prove that, for ¢ > 0 sufficiently small, the set 


My := (vy +L.) 


tel 


is asmooth manifold of dimension m = n — 1. A manifold which arises this way is 
called developable. Show that the tangent spaces of Mp are the original subspaces 
E(t), ie. 


T,Mo = E(t) for PEeyt)+ Lt. 
(One therefore calls Mo the “envelope” of the hyperplanes y(t) + E(t).) Show 


that Mo is flat. (Hint: use Gau8—Codazzi.) If (®, y, y’) is a development of Mo 
along R”, show that the map ¢ : Mp > R”, defined by 


d(y(t) +) := yO) + Ov 
for v € L(t)¢, is an isometry onto an open set Mj C R’”. Thus a development “un- 


rolls” Mo onto the Euclidean space R”. When n = 3 and m = 2 one can visualize 
Mo as a twisted sheet of paper (see Fig. 6.3). 
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Remark 6.2.9 Given a codimension-! submanifold 
M Cc R™1! 


andacurve y : R — M we may form the osculating developable Mo to M along y 
by taking 


E(t) = Tya)M. 

This developable has common affine tangent spaces with M along y as 

Ty(y)Mo = E(t) = TyyM 
for every t. This gives a nice interpretation of parallel transport: Mo may be un- 
rolled onto a hyperplane where parallel transport has an obvious meaning and the 
identification of the tangent spaces thereby defines parallel transport in M. (See 
Remark 3.5.16.) 
Exercise 6.2.10 Each of the following is a developable surface in R?. 
(i) Acone ona plane curve I" Cc H, ie. 


M={tp+(U-dq|t>0,¢eT} 


where H C R? is an affine hyperplane, p € R*\H, and I’ C H isa 1-manifold. 
(ii) A cylinder on a plane curve I’, i.e. 


M=({qg+tulqdel,t eR} 
where Hf and J" are as in (i) and v is a fixed vector not parallel to H. (This is 
a cone with the cone point p at infinity.) 
(iii) The tangent developable to a space curve y : R > R?, ice. 
M = {y(t) + sy(t) | |t —to| < ¢, O< 8 <e}, 
where y(fo) and y (to) are linearly independent and ¢ > 0 is sufficiently small. 


(iv) The paper model of a Mobius strip (see Fig. 6.3). 


Fig. 6.3 Developable sur- 
faces 
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Fig. 6.4 A circular one- 
sheeted hyperboloid 


Remark 6.2.11 A 2-dimensional submanifold M C R? is called a ruled surface 
iff there is a straight line in M through every point. Every developable surface 
is ruled, however, there are ruled surfaces that are not developable. An example 
is the manifold M = {y(t) + s¥(t)| |t —to| < e, |s| < ¢} where y: R > R? isa 
smooth curve with |y| = 1 and j/(to) 4 0, and ¢ > 0 is sufficiently small; this sur- 
face is not developable in general. Other examples are the elliptic hyperboloid of 
one sheet 
x2 2 
M:= Jenzy eR? St 5=1] (6.2.4) 


depicted in Fig. 6.4, the hyperbolic paraboloid 


2 2 
ge 2? , (6.2.5) 


M:= Jor yz) ER 2 pe 


(both with two straight lines through every point in /), Pliicker’s conoid 


2 
Mim fey) eR Pe 202= 52}, (6.2.6) 
xe +y 
the helicoid 
=e eR | A a eel (6.2.7) 
Vx? + y? 
and the Mobius strip 
cos(s) cos(s/2) cos(s) 
M := sin(s) | + 5 cos(s/2) sin(s) E ee : (6.2.8) 
0 sin(s /2) 


These five surfaces have negative Gaufian curvature. The Mobius strip in (6.2.8) 
is not developable, while the paper model of the Mobius strip is. The helicoid 
in (6.2.7) is a minimal surface, i.e. its mean curvature (the trace of the second 
fundamental form) vanishes. A minimal surface which is not ruled is the catenoid 


M := {(x,y,z) € R?| x? + y? =c? cosh(z/c)}. 


(Exercise: Prove all this.) 
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6.3 Symmetric Spaces 


In the last section we applied the Cartan-Ambrose-Hicks Theorem in the flat case; 
the hypothesis ®,.R = R’ was easy to verify since both sides vanish. To find more 
general situations where we can verify this hypothesis note that for any development 
(®, y, y’) satisfying the initial conditions 


YO)= po, YO=pM, P(0)=%, 
we have 
P(t) = ©, (t, 0) PoP, (0, £) 
so that the hypothesis ®, R = R’ is certainly implied by the three hypotheses 


P(t, 0). Ryo = Ry 
D),(t, 0). Ry — Ry) 
_ pl 
(®o)«Rpy = Ry. 
The last hypothesis is a condition on the initial linear isomorphism 
®):T,,M — Ty M' 


while the former hypotheses are conditions on M and M’ respectively, namely, that 
the Riemann curvature tensor is invariant by parallel transport. It is rather amazing 
that this condition is equivalent to a simple geometric condition as we now show. 


6.3.1 Symmetric Spaces 


Definition 6.3.1 A Riemannian manifold M is called symmetric about the point 
p € M iff there exists a (necessarily unique) isometry 


@¢:M—>M 
satisfying 
$(P) = Pp, do(p) = —id; (6.3.1) 


M is called a symmetric space iff it is symmetric about each of its points. A 
Riemannian manifold M is called locally symmetric about the point p € M iff, 
for r > 0 sufficiently small, there exists an isometry 


@:U,(p,M) > U,(p,M), U,(p, M) := {¢ € M |d(p,q) <r}, 


satisfying (6.3.1); M is called a locally symmetric space iff it is locally symmetric 
about each of its points. 
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Remark 6.3.2. The proof of Theorem 6.3.4 below will show that, if M is 
locally symmetric, the isometry ¢ : U,(p,M) — U,(p,M) with $(p) = p 
and d@(p) = —id exists whenever 0 < r < inj(p). 


Exercise 6.3.3. Every symmetric space is complete. Hint: If y: ] > M is a 
geodesic and ¢ : M — M is a symmetry about the point y(fo) for f) € J, then 
o(y(to + t)) = y(to —¢t) for allt € R withtpn +t, —t el. 


Theorem 6.3.4 Let M CR" be anm-dimensional submanifold. Then the following 
are equivalent. 


G) M is locally symmetric. 
(ii) The covariant derivative V R (defined below) vanishes identically, i.e. 


(Vy R)p (v1, v2)w = 0 


forall p € M andv,v,,v2,w € T,M. 
(iii) The curvature tensor R is invariant under parallel transport, i.e. 


P(t, 5)xRys) = Ry) (6.3.2) 
for every smooth curve y :R — M andalls,t € R. 


Proof See Sect. 6.3.2. Oo 


Corollary 6.3.5 Let M and M' be locally symmetric spaces and fix two points Po € 
M and py € M’, and let ®y : T,,M — T,, M' be an orthogonal linear isomorphism. 
Let r > 0 be less than the injectivity radius of M at po and the injectivity radius of 
M' at po. Then the following holds. 


(i) There exists an isometry @ : U,(po, M) — U,(po, M’) with @(po) = pp and 
do(po) = ®po if and only if Bo intertwines R and R’, i.e. 


(®)-Rpy = Ry. (6.3.3) 


(ii) Assume M and M’ are connected, simply connected, and complete. Then there 
exists an isometry ¢: M — M' with (po) = po and do(po) = ®o if and only 
if Do satisfies (6.3.3). 


Proof In (i) and (ii) the “only if” statement follows from Theorem 5.3.1 (Theo- 
rema Egregium) with ®) := d#(po). To prove the “if” statement, let (®, y, y’) be 
a development satisfying y(0) = po, y’(0) = po, and ®(0) = Go. Since R and R’ 
are invariant under parallel transport, by Theorem 6.3.4, it follows from the discus- 
sion in the beginning of this section that ®,R = R’. Hence assertion (i) follows 
from the local C-A-H Theorem 6.1.17 and assertion (ii) follows from the global 
C-A-H Theorem 6.1.8. Oo 
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Corollary 6.3.6 A connected, simply connected, complete, locally symmetric 
space is symmetric. 


Proof Corollary 6.3.5 (ii) with M’ = M, po = po, and ®y) = —id. Oo 


Corollary 6.3.7 A connected symmetric space M is homogeneous; i.e. given 
P.qd € M there exists an isometry 6: M > M with 6(p) = q. 


Proof If M is simply connected, the assertion follows from part (11) of Corol- 
lary 6.3.5 with M = M’, po = p, py = 4, and Gy = (1,0): 7,M > TM, 
where y : [0,1] ~ M isacurve from p to q. If M is not simply connected, we can 
argue as follows. There is an equivalence relation on M defined by 


P~L iS disometryd: M > M 3 o(p) = 4. 


Let p,q € M and suppose that d(p, g) < inj(p). By Theorem 4.4.4 there is a unique 
shortest geodesic y : [0, 1] — M connecting p to g. Since M is symmetric there is 
an isometry @: M — M such that (y(1/2)) = y(1/2) and dg(y(1/2)) = —id. This 
isometry satisfies @(y(t)) = y(1 — t) and hence ¢(p) = q. Thus p ~ q whenever 
d(p,q) < inj(p). This shows that each equivalence class is open, hence each equiv- 
alence class is also closed, and hence there is only one equivalence class because 
M is connected. This proves Corollary 6.3.7. Oo 


6.3.2 Covariant Derivative of the Curvature 


For two vector spaces V,W and an integer k > 1 we denote by L*(V,W) 
the vector space of all multi-linear maps from V‘ = V x --- x V to W. Thus 
L'(V, W) = L(V, W) is the space of all linear maps from V to W. 


Definition 6.3.8 The covariant derivative of the Riemann curvature tensor as- 
signs to every p € M a linear map 


(VR)p 2 TyM > L£?(T,M, L(T)M, T,M)) 
such that 


(VR)(X)(X1, X2)¥ = Vx (R(X, X2)Y) — R(Vx X1, X2)¥ (63.4) 
— R(X, Vy Xo)¥ — R(X, X2)VxY 
for all X, X;, X2, Y € Vect(M). We also use the notation 
(Vy R)p = (VR)p(v) 
for p € M and v € T,M so that 
(Vx R)(X1, X2)¥ = (VR)(X)(X1, X2)¥ 


for all X¥, X;, X2, Y € Vect(M). 
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Remark 6.3.9 One verifies easily that the map 
Vect(M)* — Vect(M) : (X, X1, X2, Y) (Vx R)(X1, X2)Y, 


defined by the right hand side of equation (6.3.4), is multi-linear over the ring of 
functions ¥(M). Hence it follows as in Remark 5.2.13 that VR is well defined, i.e. 
that the right hand side of (6.3.4) at p € M depends only on the tangent vectors 


X(p), Xi(p), X2(p), Y(p). 
Remark 6.3.10 Let y : J — M beasmooth curve on an interval J C R and 
X,,X2,Y € Vect(y) 


be smooth vector fields along y. Then equation (6.3.4) continues to hold with X re- 
placed by y and each Vy on the right hand side replaced by the covariant derivative 
of the respective vector field along y: 


(V,R)(X1, X2)¥ = V(R(X1, X2)¥) — R(VX1, X2)Y 


(6.3.5) 
= R(X), VXO)Y = RY XV. 


Theorem 6.3.11 


G) Ify:R— M isa smooth curve such that y(0) = p and y(O) = v, then 


d 
(VoR)p = = 2 0.1)«Ryo (6.3.6) 
t= 


(ii) The covariant derivative of the Riemann curvature tensor satisfies the second 
Bianchi identity 


(Vy R)(Y, Z) + (Vy R)(Z, X) + (Vz R)(X, Y) = 0. (6.3.7) 
Proof We prove (i). Let vj,v2,w € T,M and choose parallel vector fields 


X\,X2, Y € Vect(y) along y satisfying the initial conditions X| (0) = v1, X2(0) = vo, 
Y(0) = w. Thus 


X\(t) = P(t, 0)u1, X2(t) = O,(t,0)v2, Y(t) = @ (¢,0)w. 
Then the last three terms on the right vanish in equation (6.3.5) and hence 


(V,R)(v1, v2)w = V(R(X,, X2) ¥)(0) 
Py, Ry (Xi), X20) VO) 
t=0 


a 
dt 
d 
di 0,1) Ry ay) (Py (¢, 0) v1, ®, (¢, 0) v2) B (t, 0) w 
d 


— 7d (2, (0, t), Ry) (V1 ’ U2) w. 
t\+=0 


Here the second equation follows from Theorem 3.3.6. This proves (i). 


270 6 Geometry and Topology 


We prove (ii). Choose a smooth function y : R* — M and denote by (r,s, t) the 
coordinates on R?. If Y is a vector field along y, we have 


(Va,yR)(Osy, O:Y)Y = V-(ROsy. Ory)Y) — ROsy. Iry)V-¥ 
— R(V,dsy, I:y)Y — ROsy, VrOry)Y 
= V,(V,V;:Y — V;V,Y) — (VV; — VV, V,¥ 
+ R(:y, V-dsv)Y¥ — Rsv, Vid-y)Y. 


Permuting the variables r,s,¢ cyclically and taking the sum of the resulting three 
equations we obtain 


(Va,yR)Osy, O1Y)Y + (Vay R)(OrY, Or Y)Y + (Vay ROY, Osv)Y 
= V.(VsVi¥ —ViV.¥) = (VeVi = VV) Ve¥ 

+ VAV MeY = V9) = OL = VV 

4+ VAV,V.Y —V.V,-¥) = (VV, = VV iY. 


The terms on the right cancel out. This proves Theorem 6.3.11. O 


Proof of Theorem 6.3.4 We prove that (iii) implies (1). This follows from the local 
Cartan—Ambrose-Hicks Theorem 6.1.17 with 


Py = Po=— DP, @) = —id: T,M > T,M. 

This isomorphism satisfies 
(Po). Rp = Ry. 
Hence it follows from the discussion in the beginning of this section that 
®,.R = R’ 
for every development (@, y, y’) of M along itself satisfying 
yv0)=y'(0)=p, (0) = —id. 
Hence, by the local C-A-H Theorem 6.1.17, there is an isometry 
o: U,(p,M) > U,(p,M) 
satisfying 
b(p)=p,  do(p) =—id 


whenever 0 < r < inj(p; M). 
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We prove that (i) implies (ii). By Theorem 5.3.1 (Theorema Egregium), every 
isometry ¢ : M — M’ preserves the Riemann curvature tensor and covariant differ- 
entiation, and hence also the covariant derivative of the Riemann curvature tensor, 
ie. 


ox (VR) = VR’. 
Applying this to the local isometry ¢ : U,(p, M) — U,(p, M) we obtain 
(Var R) gp) (4b(P)r1. db (p)v2) = db(p)(Vo R)(v1, v2)db(p) 
for all v, vj, v2 € TM. Since 
d$(p) = —id 
this shows that V R vanishes at p. 
We prove that (ii) imlies (iii). If VR vanishes, then equation (6.3.6) in Theo- 
rem 6.3.11 shows that the function 


St> ®,(t,5)4Rys) = P(t, 0). P, (0, 5). Ry) 


is constant and hence is everywhere equal to Ry). This implies (6.3.2) and com- 
pletes the proof of Theorem 6.3.4. O 


6.3.3 Covariant Derivative of the Curvature in Local Coordinates 


Let 6: U > 2 bea local coordinate chart on M with values in an open set 2 C R”, 
denote its inverse by y := @-! : 2 > U, and let 


3 
E(x) = (a) € Tro, xEQ, i=l,...,m, 


be the local frame of the tangent bundle determined by this coordinate chart. Let 
ck : 82 — R denote the Christoffel symbols and RE , 1 2 — R the coefficients of 
the Riemann curvature tensor so that 


ViE = OTEK, RUE, E)) Ex = Y> Rin Ee. 
k £ 


Given i, j,k, € {1,...,m} we can express the vector field (Vz, R)(E;, Ex) E¢ 
along y for each x € 92 as a linear combination of the basis vectors E; (x). This 
gives rise to functions 


Vi Rigi 2>R 
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defined by 


(Ve, R)(E;, Ex) Ee =: > Vv; Rip Ey. (6.3.8) 


These functions are given by 


ViRjce = Rie + Tp Rie 
be 


P , : (6.3.9) 
= uke ~ > Tie Rive — > Me fey 
be be u 
The second Bianchi identity has the form 
Vi Rice + Vi Rie + ViRiie = 0. (6.3.10) 


Exercise: Prove equations (6.3.9) and (6.3.10). Warning: As in Sect. 5.4, care must 
be taken with the ordering of the indices. Some authors use the notation V; Rex for 
what we call V; Rice: 


6.3.4 Examples and Exercises 
Example 6.3.12 Every flat manifold is locally symmetric. 
Example 6.3.13 If M, and M; are (locally) symmetric, so is M, x Mp. 


Example 6.3.14 M = R” with the standard metric is a symmetric space. Recall 
that the isometry group 7(IR”) consists of all affine transformations of the form 


g(x) = Ax +b, AéeO(m), be R”. 


(See Exercise 5.1.4.) The isometry with fixed point p € R” and d¢(p) = —id is 
given by #(x) = 2p —x for x € R”. 


Example 6.3.15 The flat tori of Exercise 6.2.7 in the previous section are symmet- 
ric (but not simply connected). This shows that the hypothesis of simple connectiv- 
ity cannot be dropped in part (ii) of Corollary 6.3.5. 


Example 6.3.16 Below we define manifolds of constant curvature and show that 
they are locally symmetric. The simplest example, after a flat space, is the unit 
sphere S$” = {x ER™*!| |x| = 1}. The symmetry ¢ of the sphere about a point 
Pp € M is given by 


P(x) = —x + 2(p,x)p 
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for x € S’”. This extends to an orthogonal linear transformation of the ambient 
space. In fact the group of isometries of S”” is the group O(m + 1) of orthogonal 
linear transformations of R”*! (see Example 6.4.16 below). In accordance with 
Corollary 6.3.7 this group acts transitively on S”’. 


Example 6.3.17 A compact two-dimensional manifold of constant negative curva- 
ture is locally symmetric (as its universal cover is symmetric) but not homogeneous 
(as closed geodesics of a given period are isolated). Hence it is not symmetric. This 
shows that the hypothesis that M be simply connected cannot be dropped in Corol- 
lary 6.3.6. 


Example 6.3.18 The real projective space RP” with the metric inherited from S” 
is a symmetric space and the orthogonal group O(n + 1) acts on it by isometries. 
The complex projective space CP” with the Fubini-Study metric in Example 3.7.5 
is a symmetric space and the unitary group U(n + 1) acts on it by isometries. The 
complex Grassmannian G;(C”) in Example 3.7.6 is a symmetric space and the 
unitary group U() acts on it by isometries. (Exercise: Prove this.) 


Example 6.3.19 The simplest example of a symmetric space which is not of 
constant curvature is the orthogonal group O(7) = { geR"™|g'g= 1} with the 
Riemannian metric (5.2.20) of Example 5.2.18. The symmetry ¢ about the point 
a € O(n) is given by ¢(g) = ag~!a. This discussion extends to every Lie subgroup 
G C O(n). (Exercise: Prove this.) 


6.4 Constant Curvature 


In the Sect. 5.3 we saw that the GauBian curvature of a two-dimensional surface is 
intrinsic: we gave a formula for it in terms of the Riemann curvature tensor and the 
first fundamental form. We may use this formula to define the GauBian curvature 
for any two-dimensional manifold (even if its codimension is greater than one). We 
make a slightly more general definition. 


6.4.1 Sectional Curvature 


Definition 6.4.1 Let M C R” be asmooth m-dimensional submanifold. Let p € M 
and let E C T,M be a 2-dimensional linear subspace of the tangent space. The 
sectional curvature of M at (p, E) is the number 


(Rp(u, v)v, u) 


Fd Ce a ee eae 
(PE) = TF alye— (uo) 


(6.4.1) 


where u,v € E are linearly independent (and hence form a basis of £). 
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The right hand side of (6.4.1) remains unchanged if we multiply u or v by a 
nonzero real number or add to one of the vectors a real multiple of the other; hence 
it depends only on the linear subspace spanned by wu ad v. 


Example 6.4.2. If M C R? is a 2-manifold, then by Theorem 5.3.9 the sectional 
curvature K(p, 7,M) = K(p) is the GauBian curvature of M at p. More generally, 
for any 2-manifold M Cc R” (whether or not it has codimension one) we define the 
GauBian curvature of M at p by 


K(p) := K(p,T,M). (6.4.2) 
Example 6.4.3 If M CR”*! is a submanifold of codimension one and v: M > S” 


is a Gau8 map, then the sectional curvature of a 2-dimensional subspace E C T,M 
spanned by two linearly independent tangent vectors u, v € T,M is given by 


(u,dv(p)u)(v, dv(pyv) = (u,dv(p)v)? 


K(p, E) = 
(p£) luPloP — (uw, vy 


(6.4.3) 


This follows from equation (5.3.8) in the proof of Theorem 5.3.9 which holds 
in all dimensions. In particular, when M = S’, we have v(p) = p and hence 
K(p, E) = 1 for all p and E. For a sphere of radius r we have v(p) = p/r and 
hence K(p, E) = 1/r?. 


Example 6.4.4 Let G C O(7) be a Lie subgroup equipped with the Riemannian 
metric 


(v, w) := trace(v'w) 


for v,w € TgG C R"*". Then, by Example 5.2.18, the sectional curvature of G at 
the identity matrix 1 is given by 


K(,£) = sIle.P 


for every 2-dimensional linear subspace E C g = Lie(G) = 71G with an orthonor- 
mal basis &, 7. 


Exercise 6.4.5 Let E C 7T,M be a 2-dimensional linear subspace, let r > 0 be 
smaller than the injectivity radius of M at p, and let N C M be the 2-dimensional 
submanifold given by 


N := exp,({v € E||v| <r}). 


Show that the sectional curvature K(p, E) of M at (p, E) agrees with the GauBian 
curvature of N at p. 
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Exercise 6.4.6 Let p ¢ M C R” and let E C T,M be a 2-dimensional linear 


subspace. For r > 0 let L denote the ball of radius r in the (7 — m + 2)-dimensional 
affine subspace of IR” through p and parallel to the vector subspace E + 7, M a 


= {ptu+wlvek, weéeT,M-, jul? + wl? < 7°. 
Show that, for r sufficiently small, LN M is a 2-dimensional manifold with GauBian 
curvature Kj (p) at p given by Krnm(p) = K(p, £). 
6.4.2. Constant Sectional Curvature 
Definition 6.4.7 Let k ¢ R and m > 2 be an integer. An m-manifold M C R" is 
said to have constant sectional curvature « iff K(p, £) =k for every p € M and 


every 2-dimensional linear subspace E C 7,M. 


Theorem 6.4.8 Let M C R” be an m-manifold and fix an element p € M anda 
real number k. Then the following are equivalent. 


(i) K(p, E) =k for every 2-dimensional linear subspace E C T,M. 
(ii) The Riemann curvature tensor of M at p is given by 


(Rp(v4, v2)v3, V4) = k((v1, v4) (V2, V3) — (v1, 03) (v2, v4)) (6.4.4) 
for all vj, v2, 03,04 € T,M. 
Proof That (ii) implies (i) follows directly from the definition of the sectional 


curvature in (6.4.1) by taking v) = v4 = u and v2 = v3 = v in (6.4.4). Conversely, 
assume (i) and define the multi-linear map Q : T, M* > R by 


O(v1, V2, 03, U4) = (Rp(U1, v2)v3, V4) — k((v1, U4) (V2, V3) — (V1, U3) (V2, v4)). 


Then, for all u,v, v1, v2, v3, v4 € T,M, the map Q satisfies the equations 


QO(v}, V2, U3, V4) + O(V2, V1, V3, V4) = O, (6.4.5) 

O(v1, V2, V3, V4) + O(v2, V3, V1, 04) + O(V3, U1, V2, V4) = 0, (6.4.6) 
O(v1, v2, V3, V4) — O(U3, V4, U1, V2) = O, (6.4.7) 

O(u,v,u,v) = 0. (6.4.8) 


Here the first three equations follow from Theorem 5.2.14 and the last follows from 
the definition of Q and the hypothesis that the sectional curvature is K(p, E) = k 
for every 2-dimensional linear subspace E C 7,M. 
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We must prove that Q vanishes. Using (6.4.7) and (6.4.8) we find 


0 = Olu, v1 + v2,U, vy + V2) 
= Olu, V1,U, v2) + O(u, v2, U, V1) 
= 20(u, 1, U, v2) 


for all u, vj, v2 € TM. This implies 


0 = Q(uy + U2, v1, Uy + Up, V2) 


= O(uy, v1, Uz, v2) + O(u2, v1, U1, v2) 


for all uw}, v2, V1, V2 € T,M. Hence 


O(v}, V2, V3, U4) = —O(U3, V2, V1, U4) 
= Q(v2, v3, V1, V4) 
= —Q(v3, V1, V2, v4) — O(U1, V2, V3, V4). 


Here the second equation follows from (6.4.5) and the last from (6.4.6). Thus 


1 1 
O(v1, V2, V3, U4) = 72s, U1, V2» v4) = 5 2M1, 03, 02, V4) 


for all vj, v2, 03, v4 € T,M and, repeating this argument, 


1 
O(v1, V2, V3, V4) = ZE1 V2, U3, v4). 


Hence Q = 0 as claimed. This proves Theorem 6.4.8. Oo 


Remark 6.4.9 The symmetric group S4 on four symbols acts naturally on the 
space £*(7; M,R) of multi-linear maps from 7; M* to R. The conditions (6.4.5), 
(6.4.6), (6.4.7), and (6.4.8) say that the four elements 


a = id+ (12), 
c = id + (123) + (132), 
b = id— (34), 


d = id + (13) + (24) + (13)(24) 


of the group ring of S4 annihilate Q. This suggests an alternate proof of Theo- 
rem 6.4.8. A representation of a finite group is completely reducible so one can 
prove that Q = 0 by showing that any vector in any irreducible representation of 
S4 which is annihilated by the four elements a, b,c and d must necessarily be zero. 
This can be checked case by case for each irreducible representation. (The group S4 
has 5 irreducible representations: two of dimension 1, two of dimension 3, and one 
of dimension 2.) 
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If M and M’ are two m-dimensional manifolds with constant curvature k, then 
every orthogonal isomorphism ® : T, M — T,, M' intertwines the Riemann curva- 
ture tensors by Theorem 6.4.8. Hence by the appropriate version (local or global) 
of the C-A-H Theorem we have the following corollaries. 


Corollary 6.4.10 Every Riemannian manifold with constant sectional curvature 
is locally symmetric. 


Proof Theorem 6.3.4 and Theorem 6.4.8. Oo 


Corollary 6.4.11 Let M and M' be m-dimensional Riemannian manifolds with 
constant curvature k and let p € M and p’ € M’. If r > 0 is smaller than the 
injectivity radii of M at p and of M' at p’, then for every orthogonal isomorphism 


®:T,M > T,M' 
there exists an isometry 
b:U,(p,M) > U,(p', M') 
such that 


o(p) = py’. dp(p) = ®. 


Proof This follows from Corollary 6.3.5 and Corollary 6.4.10. Alternatively one 
can use Theorem 6.4.8 and the local C-A-H Theorem 6.1.17. Oo 


Corollary 6.4.12 Any two connected, simply connected, complete Riemannian 
manifolds with the same constant sectional curvature and the same dimension are 
isometric. 


Proof Theorem 6.4.8 and the global C-A-H Theorem 6.1.8. oO 


Corollary 6.4.13 Let M C R"” be a connected, simply connected, complete mani- 
fold. Then the following are equivalent. 


(i) M has constant sectional curvature. 
(ii) For every pair of points p,q € M and every orthogonal linear isomorphism 
®:T,M — T,M there exists an isometry ¢ : M — M such that 


d(P) = 4, d¢(p) = ®. 


Proof That (i) implies (i) follows immediately from Theorem 6.4.8 and the 
global C-A-H Theorem 6.1.8. Conversely assume (ii). Then, for every pair of 
points p,q € M and every orthogonal linear isomorphism 


®:T,M > T,M, 
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it follows from Theorem 5.3.1 (Theorema Egregium) that 
®, Ry = Ry 
and so 
K(p, E) = K(q, PE) 


for every 2-dimensional linear subspace E C 7,M. Since, for every pair of 
points p,q € M and of 2-dimensional linear subspaces 


ECT,M, FCT,M, 
we can find an orthogonal linear isomorphism © : T,M — T,M such that 


GE = F, 


this implies (i). Oo 


Corollary 6.4.13 asserts that a connected, simply connected, complete Rieman- 
nian m-manifold M has constant sectional curvature if and only if the isometry 
group J(M) acts transitively on its orthonormal frame bundle O(M). Note that, by 
Lemma 5.1.10, this group action is also free. 


6.4.3. Examples and Exercises 


Example 6.4.14 Any flat Riemannian manifold has constant sectional curvature 
k =0. 


Example 6.4.15 The manifold 
M =R"” 


with its standard metric is, up to isometry, the unique connected, simply connected, 
complete Riemannian m-manifold with constant sectional curvature 


k=0. 


Example 6.4.16 Form > 2 the unit sphere 
M =S" 


with its standard metric is, up to isometry, the unique connected, simply connected, 
complete Riemannian m-manifold with constant sectional curvature 


k=1. 
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Hence, by Corollary 6.4.12, every connected simply connected, complete Rieman- 
nian manifold with positive sectional curvature k = 1 is compact. Moreover, by 
Corollary 6.4.13, the isometry group 7(S”) is isomorphic to the group O(m + 1) 
of orthogonal linear transformations of R’*!. Thus, by Corollary 6.4.13, the or- 
thonormal frame bundle O(S”) is diffeomorphic to O(m + 1). This follows also 
from the fact that, if 


Ui,+-+,Um 
is an orthonormal basis of 7,5” = p+ then 
DP. V1,-++5Um 
is an orthonormal basis of R”t?. 


Example 6.4.17 A product of spheres is not a space of constant sectional curva- 
ture, but it is a symmetric space. Exercise: Prove this. 


Example 6.4.18 For n > 4 the orthogonal group O(7) is not a space of constant 
sectional curvature, but it is a symmetric space and has nonnegative sectional cur- 
vature (see Example 6.4.4). 


6.4.4 Hyperbolic Space 


Fix an integer m > 2. The hyperbolic space Hi” is, up to isometry, the unique 
connected, simply connected, complete Riemannian m-manifold with constant sec- 
tional curvature 


k=-1. 
A model for HI” can be constructed as follows. A point in R’”’*! will be denoted by 
DP = (x0, x), xo ER, X= Cian, XH) ER": 
Let QO : R”*! x R”+t! — R denote the symmetric bilinear form given by 
O(D. 4) = —XoYo + X11 +o + XY (6.4.9) 
for p = (xo, x),g = (yo. y) € R”™!. Since O is nondegenerate the space 
H” := {p = (xo,x) € R”*"| O(p, p) = —1, x0 > 0} 


is a smooth m-dimensional submanifold of R’’*! and the tangent space of H” at p 
is given by 


T,H”™ = {v € R@t! | O(p, v) = O}. 
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For p = (xo, x) € R™*! and v = (,&) € R™*! we have 


peEH” = xw=V14([x/?, 
(§, x) 
V1 + [xP 


Now let us define a Riemannian metric on HI” by 


véT,H” <> & = 


8p(v,w) = O(v, w) 


= (&,) — Gono (6.4.10) 
ip ys 
, tele 


for v = (&, €) € T,H” and w = (no, 9) € T,H”. 


Theorem 6.4.19 HI” is a connected, simply connected, complete Riemannian m- 
manifold with constant sectional curvature k = —1. 


We remark that the manifold H” does not quite fit into the extrinsic framework 
of most of this book as it is not exhibited as a submanifold of Euclidean space but 
rather of “pseudo-Euclidean space”: the positive definite inner product (v, w) of 
the ambient space R’t! is replaced by a nondegenerate symmetric bilinear form 
O(v, w). However, all the theory developed thus far goes through (reading Q(v, w) 
for (v, w)) provided we impose the additional hypothesis (true in the example M = 
IH!" that the first fundamental form g, = Q|7, w is positive definite. For then Q|r,.u 
is nondegenerate and we may define the orthogonal projection [7(p) onto 7, M as 
before. The next lemma summarizes the basic observations; the proof is an exercise 
in linear algebra. 


Lemma 6.4.20 Let QO be a symmetric bilinear form on a vector space V and for 
each subspace E of V define its orthogonal complement by 


E+? := {u € V| O(u,v) = 0 Wu € E}. 


Assume Q is nondegenerate, i.e. V+ = {0}. Then, for every linear subspace E CV, 
we have 


V=E@Et? = EN Et? = {0}, 


i.e. E+2 is a vector space complement of E if and only if the restriction of Q to E 
is nondegenerate. 
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Proof of Theorem 6.4.19 The proofs of the various properties of H’” are entirely 
analogous to the corresponding proofs for S$”. Thus the unit normal field to HI” is 
given by 


v(p) = Pp 
for p € H” although the “square of its length” is 


O(p, p) = -1. 


For p € H” we introduce the Q-orthogonal projection I7(p) of R’*! onto the 
subspace 7, HI”. It is characterized by the conditions 


II(p)’ =I1(p), —kerf1(p) Lo im/7(p), —_—imI(p) = T,H”, 
and is given by the explicit formula 


IT(p)v =v + Ov, p)p 


forv € R™*!, 
The covariant derivative of a vector field X € Vect(y) along a smooth curve 
y :R— H" is given by 


VX(t) = M(y(t))X() 
= X() + XO. VOW 
= X01) - OXO. yO). 
The last identity follows by differentiating the equation O(X, y) = 0. This can be 
interpreted as the hyperbolic Gauf{—Weingarten formula as follows. For p € HI” and 
u € T,H” we introduce, as before, the second fundamental form 
hp(u) : TpH™ > (T,H™)+2 
via 
hy(u)v := (dIT(p)u)v 
and denote its Q-adjoint by 
hp(u)* : (IpH")"2 > T>H™. 


For all p ¢ H”,u ¢ T,H”, and v € R”+! we have 


(ammo =F) (0+ O07 + rY(p + 1) 
t t=0 


Qu, p)u + Ov, u)p, 


282 6 Geometry and Topology 


where the first summand on the right is tangent to H!” and the second summand is 
Q-orthogonal to 7, HI”. Hence 


hp(u)v = O(v,u)p, h,(u)*w = Q(w, p)u (6.4.11) 


for v € T,H” and w ¢€ (7,H")"2, 
With this understood, the GauB-Weingarten formula 


X=VX +h,(y)X 
extends to the present setting. The reader may verify that the operators 
V : Vect(y) — Vect(y) 
thus defined satisfy the axioms of Theorem 3.7.8 and hence define the Levi-Civita 
connection on H’””. 
Now a smooth curve y : J — Hi’ is a geodesic if and only if it satisfies the 
equivalent conditions 


Vy=0 <= jyp@)loT,wH” Vtrel = y=OQ(y.yY)y. 


A geodesic must satisfy the equation 


d ae rae 
Beow) =20(y,y) =0 


because y is a scalar multiple of y, and hence Q(y,y) is constant. Fix an ele- 
ment p € H” and a tangent vector v € 7,/H” such that 


O(v,v) = 1. 
Then the geodesic y : R > HI” with y(0) = p and y(0) = v is given by 
y(t) = cosh(t) p + sinh(t)v, (6.4.12) 
where 
ete ei —en 
h(t) := ———_, inh(t) := 
cosh(t) 5 sinh(t) 5 


In fact we have y(t) = y(t) Lo T,)H”. It follows that the geodesics exist for 
all time and hence H” is geodesically complete. Moreover, being diffeomorphic to 
Euclidean space, HI” is connected and simply connected. 

It remains to prove that HH” has constant sectional curvature k = —1. To see this 
we use the Gaufi—Codazzi formula in the hyperbolic setting, i.e. 


R,(u, v) = hy(u)*h,(v) — hyp(v)*hp(u). (6.4.13) 
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By equation (6.4.11), this gives 


(Rp(u, v)v,u) = O(hp(uju, hp(v)v) — O(Ap(v)u, hp(u)v) 
= Q(Q(u,u)p, O(v, v)p) — Q(Q(u, v)p, Olu, v)p) 
= —O(u,u)Q(v,v) + Ou, v) 
= —8p(u,u)gp(v,v) + gp(u,v) 


for all u,v € T, HI”. Hence, for every p € M and every 2-dimensional linear sub- 
space E C T,M with a basis u,v € E we have 


(Rp(u, v)v, u) _ 
Sp(u,U)gp(v,v) — Bp(u,v)? 


K(p, E) = 


This proves Theorem 6.4.19. O 


Exercise 6.4.21 Prove that the pullback of the metric on H!” under the diffeomor- 
phism 


R” > H":xph (v1 + [xP, x) 
is given by 


ee — 8? 
1+ |x/ 


[x], = 


for x € R” and Xx € R” = T,R”. Thus the metric tensor is given by 


XjXj 


Bij (x) = bij 
for x = (X1,...,Xm) € R”. 


Exercise 6.4.22 The Poincaré model of hyperbolic space is the open unit disc 
D” C R” equipped with the Poincaré metric 


> 1=|y? 


rl 2\9| 


for y € D” and y € R” = 7,D”. Thus the metric tensor is given by 


(1-1?) 


yeD". (6.4.15) 


Sij (y) = 
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Prove that the diffeomorphism 


1 = 3 
De att yo ( + Il -) 
L=|yl° L=|y] 


is an isometry with the inverse 


HH” > D” : (x%,x) 


14+ x9 
Interpret this map as a stereographic projection from the south pole (—1, 0). 


Exercise 6.4.23 The composition of the isometries in Exercise 6.4.21 and Exer- 
cise 6.4.22 is the diffeomorphism R” —> D” : x b& y given by 


x 2y 1+ |y 
ee ee ery es 
V1+|x/2+1 1—|y/? 1—ly/? 


Prove that this is an isometry intertwining the Riemannian metrics (6.4.14) 
and (6.4.15). 


| 2 


Exercise 6.4.24 This exercise shows that every nonconstant geodesic in the 
Poincaré model DD” of hyperbolic space in Exercise 6.4.22 converges to two points 
on the boundary S”"~! = dD” in forward and backward time, and that any two dis- 
tinct points on the boundary are the asymptotic limits of a unique geodesic in D” 
up to reparametrization. 

Fix an element y € D” and a tangent vector y € 7,D” = R” of norm one in 
the hyperbolic metric, i.e. 


2 


Alyl=1,  Arv=——. 
b= |x? 


(6.4.16) 


Let y : R > D” be the unique geodesic satisfying y(0) = y and y(0) = ¥. Prove 
the following. 


(a) The geodesic y is given by the explicit formula 


cosh(t)Ay + sinh(t)(AY + (Ay, AP) y) 


WO Fa: cosh(t)(A — 1) + sinh(r)(Ay, A3) cae 
for t € R. Hint: Use (6.4.12) and the isometries in Exercise 6.4.22. 
(b) The limits 
ye = lim y(t) es" 
exist and are given by 
_ Ay +Ay + Ay Ady AY AP ANY, ea ag) 


oS FeteOpaey 2 Stag Ag) 
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(c) Assume y ¢ Ry. Then there is a unique circle in R” through y_ and y, that is 
orthogonal to S”~! at y. The center c € R” and the radius r of this circle are 
given by 


oye tye (WA (Ay, AVY?) y — Ay, AP)AP 
a a 2_ = N90 
1+ (y+, ¥-) A* = 21 — (Ay, AY) (6.4.19) 
4. 1 =e yo 1 


r 


~ T+ (yyy) A220 — (Ay, AD)?’ 


(d) Let c and r be as in (c). Then the geodesic y in (a) satisfies |y(t) — c| = r for 
all t. Hint: It suffices to verify this equation for ¢ = 0. 

(e) If y € Ry, then y_ + y, = 0 and the geodesic y traverses a segment of a 
straight line through the origin. 

(f) Fix two distinct points y_ and y, on the unit sphere S’’~!. Then there ex- 
ists a geodesic y : R > D” such that lim,_.4., y(t) = y+. If vy’: R > D” is 
any other geodesic satisfying lim;.+.5 y’(t) = y+, then there exist real num- 
bers a, b such that a > 0 and y’(t) = y(at +b) forallt € R. 


m 


Exercise 6.4.25 Prove that the isometry group of HI!” is the pseudo-orthogonal 


group 


71(H’) = O(m, 1) := }g € GL(m + 1) 


O(gv, gw) = at 


for all v,w € R”*! 


Thus, by Corollary 6.4.13, the orthonormal frame bundle O(/HI”) is diffeomorphic 
to O(m, 1). 


Exercise 6.4.26 Prove that the exponential map exp, : 7,H” — H" is given by 


sinh( /O(, v)) 
Yow») 


for v € 7,H” = p+. Prove that this map is a diffeomorphism for every p € Hi”. 
Thus any two points in HI!” are connected by a unique geodesic. Prove that the 
intrinsic distance function on hyperbolic space is given by 


d(p.q) = cosh '(Q(p,q)) (6.4.21) 


for p,q € H’”. Compare this with Example 4.3.11. 


(6.4.20) 


exp,(v) = cosh( O(v, v”))p + 


Exercise 6.4.27 In the case m = 2 the Poincaré model of hyperbolic space in Ex- 
ercise 6.4.22 is the open unit disc D C C in the complex plane. It can be identified 
with the upper half plane 


H = {z € C | Im(z) > 0} 
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via the diffeomorphism 


1i—Z 


i+z 


H-D:zpr 


Show that the pullback of the Poincaré metric on D under this diffeomorphism is 
the Riemannian metric on H given by 


forz = x +iy € HandZ e€ T,H = C. Show that the isometries of H (in the iden- 
tity component) have the form 


(jae é q 


; € SL(2,R), 

cz+d d oo 

and deduce that the Lie group PSL(2, R) := SL(2, R)/{+1} is isomorphic to the 
identity component of O(2, 1). Prove that every nonconstant geodesic in H traverses 
either a vertical half line or a semicircle centered at a point on the boundary DH = R. 


6.5 Nonpositive Sectional Curvature 


In the previous section we have seen that any two points in a connected, simply con- 
nected, complete manifold M of constant negative curvature are joined by a unique 
geodesic (Exercise 6.4.26). Thus the entire manifold M is geodesically convex and 
its injectivity radius is infinity. This continues to hold in much greater generality 
for manifolds with nonpositive sectional curvature. It is convenient, at this point, to 
extend the discussion to Riemannian manifolds in the intrinsic setting. In particular, 
at some point in the proof of the main theorem of this section and in our main ex- 
ample, we shall work with a Riemannian metric that does not arise (in any obvious 
way) from an embedding. 


Definition 6.5.1 A Riemannian manifold M is said to have nonpositive sec- 
tional curvature iff K(p, £) < 0 for every p € M and every 2-dimensional lin- 
ear subspace E C T,M or, equivalently, (R,(u,v)v,u) < 0 for all p € M and 
all u,v € T,M. A nonempty, connected, simply connected, complete Riemannan 
manifold with nonpositive sectional curvature is called a Hadamard manifold. 


6.5.1 The Cartan—Hadamard Theorem 


The next theorem shows that every Hadamard manifold is diffeomorphic to Eu- 
clidean space and has infinite injectivity radius. This is in sharp contrast to positive 
curvature manifolds as the example M = S” shows. 
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Theorem 6.5.2 (Cartan—Hadamard) Let M be a connected, simply connected, 
complete Riemannan manifold. Then the following are equivalent. 


(i) M has nonpositive sectional curvature. 
(ii) The derivative of each exponential map is length increasing, i.e. 


|d exp, (v)3] > [5 


forall p € M andallv,v € T,M. 
(iii) Each exponential map is distance increasing, i.e. 


d(exp, (v0), exp, (v1)) = |vo — v1 
forall p € M andall vo, v, € T,M. 
Moreover, if these equivalent conditions are satisfied, then the exponential map 
exp, : 7,>M — M isa diffeomorphism for every p € M. Thus any two points in M 
are joined by a unique geodesic. 
The proof makes use of the following two exercises. 
Exercise 6.5.3. Let & : [0, 00) — R” be a smooth function such that 
E(0)=0, EO) 40, EG) 40 Vr>0. 


Prove that the function f : [(0,co) > R given by f(t) := |E(t)| is smooth. Hint: 
The function 7 : [0,0o) > R” defined by 


t'é(t), fort > 0, 
&(0), fort = 0, 


n(t) = 
is smooth. Show that f is differentiable and a = |n\'(n, é), 
Exercise 6.5.4 Let € : R — R” be a smooth function such that 
§(0) =0, — §(0) =0. 
Prove that there exist constants e > 0 and c > O such that, for all t € R, 
It<e => EOPIEOP- EO.EO s ele!” 
Hint: Write 
EH=v+n0), EH=v+ AO) 


with n(t) = O(t) and 7(t) = O(t?). Show that the terms of order 2 and 4 cancel 
in the Taylor expansion at tf = 0. 
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Proof of Theorem 6.5.2 We prove that (i) implies (ii). Fix a point p € M and 
two tangent vectors v,v € T,M. Assume without loss of generality that Vv # 0 and 
define the curve y : R — M and the vector field X € Vect(y) along y by 


0 a 
y(t) := exp,(tv), X(t) := as exp,(¢t(v + sv)) € TyyM (6.5.1) 
0 


s= 


fort € R. Then 
X(0) = 0, X(t) = d exp, (tv)tv, VX(0) =v 0. (6.5.2) 

To see this, define the map 8 : R? > M by B(s,t) := exp, (t(v + sv)). It satisfies 
B(O,t) = y(t), 0,8 (0, t) = X(t), B(s, 0) = p, and 0, B(s,0) =v+sV forall s,t € R. 
Hence V X(0) = V;0,8(0, 0) = V;0;8(0,0) = 0. Moreover, the curve (s,-) is a 
geodesic for every s, and hence Lemma 6.1.18 asserts that X = 0,f(0,-) is a Jacobi 
field along y, i.e. 

VVX + R(X, y)y = 0. (6.5.3) 
It follows from Exercise 6.5.3 with &(t) := ®,(0,t)X(¢) that the function 

[0,0c0) > Rit bh |X(1)| 
is smooth and 


Gi _,XOl=IVXO = 


t 


Moreover, for t > 0, we have 


qd? x) = d (X,VX) 
dt? dt |X| 
|\VX|2+(X,VVX) (Xx, VX) 
_ ba (XB (6.5.4) 
[XPIVX)?—(X, VX)? (X, RY, X)y) 
7 [x3 |X| 
> 0. 


Here the third equality follows from the fact that X is a Jacobi field along y, and 
the inequality follows from the nonpositive sectional curvature condition in (i) and 
from the Cauchy—Schwarz inequality. Thus the second derivative of the function 
[0,0o) > R: tts |X(t)|—t|0]| is nonnegative; so its first derivative is nondecreasing 
and it vanishes at f = 0; thus 


|X(0)| — t/o| = 0 
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for every ¢ > 0. In particular, for ¢ = 1 we obtain 
|d exp, (v)d] = |X()| = [I 
as claimed. Thus we have proved that (i) implies (ii). 
We prove that (ii) implies (i). Assume, by contradiction, that (ii) holds but there 
exists a point p € M anda pair of vectors v,v € T,M such that 


(Rp(v,d)v,0) <0. (6.5.5) 


Define y: R— M and X € Vect(y) by (6.5.1) so that (6.5.2) and (6.5.3) are satisfied. 
Thus X is a Jacobi field with 


X0)=0, VX(0)=T#£0. 


Hence it follows from Exercise 6.5.4 with &(t) := ®,(0, t) X(t) that there is a con- 
stant c > 0 such that, for ¢ > O sufficiently small, we have the inequality 


IX@PIVXO? —(XO, VX) < et®. 


Moreover, by (6.5.1) and (6.5.2), lim,\9 y(t) = v and lim,\9 t-! X(t) =. Hence, 
by (6.5.5) there exist constants 6 > 0 and ¢ > 0 such that 


[X= 6t, (X(t), ROD), XP) < -e2?, 


for tf > 0 sufficiently small. By (6.5.4) this implies 


dy _ IXPIVXP—(X, VX! | (XRG. XP) _ et? _ et 


dt? [x3 |x] - 8 38" 


| = 
Integrate this inequality over an interval [0, t] with ct? < ¢6* to obtain 


d d 
qrOl < a a) =| V0) 


t 


Integrating this inequality again gives | X(t)| < t|V X(0)| for small positive t. Hence 
it follows from (6.5.2) that 


|d exp, (tv)td| = |X(2)| < t| VX(0)| = t[0| 


for t > 0 sufficiently small. This contradicts (ii). 

We prove that (ii) implies that the exponential map exp, : 7,M — M isa diffeo- 
morphism for every p € M. By (ii) exp, is a local diffeomorphism, i.e. its derivative 
d exp,(v):7,M > Texp,(v)M is bijective for every v € T, M. Hence we can define 
a Riemannian metric on M’ := T, M by pulling back the metric on M under the 
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exponential map. To make this more explicit we choose a basis e€),...,@m of T7,M 
and define the map y : R” — M by 


W(x) := exp, (>: va) 


i=1 


for x = (x!,...,x’”) € R™. Define the metric tensor by 
ow ow aa 
&ij (Xx) = (Foe. 556)} i,j = 1,...,m. 


Then (IR, g) is a Riemannian manifold (covered by a single coordinate chart) and 
w : (R”, g) ~ M 1s a local isometry, by definition of g. The manifold (R’, g) is 
clearly connected and simply connected. Moreover, for every € = (€!,...,&") € 
R” = T)R”, the curve R > R” :t ++ £é is a geodesic with respect to g (because y 
is a local isometry and the image of the curve under yf is a geodesic in M). Hence 
it follows from Theorem 4.6.5 that (R’”, g) is complete. 

Since both (R’”, g) and M are connected, simply connected, and complete, it 
follows from Corollary 6.1.15 that the local isometry i is bijective. Thus the expo- 
nential map exp, : T,M — M is a diffeomorphism as claimed. It follows that any 
two points in M are joined by a unique geodesic. 

We prove that (ii) implies (iii). Fix a point p € M and two tangent vectors vg, v; € 
T,M. Let y : [0,1] > M be the geodesic with the endpoints 


y(0) = exp,(v9), —-y(1) = exp, (v1) 


and let v : [0, 1] + 7,M be the unique curve satisfying exp, (v(¢)) = y(t) for all ¢. 
Then v(0) = vo, v(1) = vy, and 


d(exp, (vo), €xp,(v1)) = L(y) 
1 


= fie exp, (v(t))¥(t)| dt 


0 
1 
= flewat 
0 
1 


> [ewa 
0 
= |v; — Uo|. 


Here the third inequality follows from (ii). This shows that (ii) implies (iii). 
We prove that (ili) implies (11). Fix a point p € M and a tangent vector v € T,M 
and denote q := exp, (v). By (iii) the exponential map exp, : 7; M — M is injective 
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and, since M is complete, it is bijective (see Theorem 4.6.6). Hence there exists a 
unique geodesic from q to any other point in M and therefore, by Theorem 4.4.4, 
we have 


|w| = d(q, exp, (w)) (6.5.6) 


for every w € T, M . Now define ¢ := exp;' oexp, :T,M — T,M. This map satisfies 
o(v) = 0. Moreover, it is differentiable in a neighborhood of v and, by the chain 
tule, d¢(v) = d exp, (v): T,»M — T,M. Now choose w := $(v +¥) with ¥€ T,M. 
Then exp, (w) = exp, (@(v + dv) = exp,(v + D) and hence it follows from (6.5.6) 
and part (iii) that 


|o(v + 0))| = |w| = dq, exp, (w)) = d(exp,(v), exp,(v + ¥)) = [vl. 
This gives 


mg ve OED On 
|d exp,(v)8| = [dg vy] = tim SO > tim = = [9), 
Thus we have proved that (iii) implies (ii) and this proves Theorem 6.5.2. Oo 


The next lemma establishes a useful inequality for Hadamard manifolds that 
amplifies the expanding property of the exponential map. 


Lemma 6.5.5. Let M be a Hadamard manifold. Fix an element p € M and two 

tangent vectors vo, v1 € T,M. Then, forO<t <T, 

d(exp, (fvo), exp (v1) _ a (exp, (T v0), exp, (701) 
t - T ; 


luo — vj| S (6.5.7) 


Proof The first inequality in (6.5.7) is part (iii) of Theorem 6.5.2. To prove the 
second inequality, assume vg 4 v, and define 


yo(t) := exp, (tvo), yi(t) = exp, (tv1) 


for ¢ € R. For each ¢ € R let the curve [0,1] ~ M: st y(s,t) be the unique 
geodesic with the endpoints y(0,t) = yo(t) and y(1,t) = y, (t). Then 


1 
£0) =dhO.nO) = / lasylds = |asy(s.0)| 
0 


for all s and ¢ and hence 
1 


; Os¥r Vs9sy) 
q- [ee 
. [2.71 


— (9svQ,t), +7, 0)) = (9570, 1), 0:70, t)) 
p(t) , 


(6.5.8) 


Co 
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Since £ (pp) = pp +t p’ and yo and y; are geodesics, this implies 
> a 
pate) + A? = —( (71.1), 8:7C1.1)) — (870.1), 8170.0) 
= v3 sy(1,t), d¢y(1, t)) — (Vids (0, ¢), 0:7 (0, £)) 


E 


(V,0sy, Ory) ds 


a] 


(IV:asyP + (V.Vrd.y. ry) as 


II 
a ee, 


[6 IVsdey? + (Rey. Bry) y. Bry) ds 
0 
p(t 


IV 


Here the last step follows from (6.5.8), the Cauchy—Schwarz inequality, and the 
nonpositive sectional curvature assumption. Thus p : [0, 7] > R is a convex func- 
tion satisfying p(0) = 0 and hence p(t) < to(T)/T for 0 <t < T. This proves 
Lemma 6.5.5. O 


6.5.2 Cartan’s Fixed Point Theorem 


Recall from Definition 6.5.1 that a Hadamard manifold is a nonempty, connected, 
simply connected, complete Riemannian manifold of nonpositive sectional curva- 
ture. 


Theorem 6.5.6 (Cartan) Let M be a Hadamard manifold and let G be a compact 
topological group that acts on M by isometries. Then there exists a point p © M 
such that gp = p for every g €G. 


The proof follows the argument given by Bill Casselmann in [16] and requires 
the following two lemmas. The first lemma asserts that every complete, connected, 
simply connected Riemannian manifold of nonpositive sectional curvature is a 
semi-hyperbolic space in the sense of Alexandrov [3]. 


Lemma 6.5.7 (Alexandrov) Let M be a Hadamard manifold, let m € M and 
v € T,,M, and define 


Po:= exp,,(—v), P= €XP,,(V). 
Then 


2d(m,q)° + 


d : 2 
CP Pi < d(po.q)’ + d(pi.q) (6.5.9) 


for every q € M (see Fig. 6.5). 
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Fig. 6.5 Alexandrov semi- Po Py 
hyperbolic space 


Proof By Theorem 6.5.2 the exponential map exp,, : TM — M is a diffeomor- 
phism. Hence d(po, pi) = 2|v|. Now let g € M. Then there is a unique tangent 
vector w € T;,M such that 


q =exp,,(w), — d(m,q) = |w]. 


Since the exponential map is expanding, by Theorem 6.5.2, we have 


d(po.g)2=|wt+v|,  — d(pi.q) 2 |w- 0]. 
Hence 
d(m,qy” = |w|? 

_ lwtoP+|w-vpP me 

7 2 

— A(P0.9)" + a(Pi.g? _ d(Po. PAY 

= 2 4 : 
This proves Lemma 6.5.7. O 


Exercise 6.5.8 Equality holds in (6.5.9) whenever M is flat. 


The next lemma is Serre’s Uniqueness Theorem for the circumcentre of a 
bounded set in a semi-hyperbolic space. 


Lemma 6.5.9 (Serre) Let M be a Hadamard manifold and, for p € M andr = 0, 
denote by B(p,r) C M the closed ball of radius r centered at p. Let 2 C M bea 
nonempty bounded set and define 


rg := inf{r > 0| there exists a p € M such that 2 C B(p,r)}. 


Then there exists a unique point pa € M such that 2 C B(pa,ra) (see Fig. 6.6). 
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Fig.6.6 The circumcenter of 
a bounded set NN 


Proof We prove existence. Choose sequences r; > rg and p; € M such that 
QC B(pi.Ti). lim r; = re. 
1—>>0Co 


Choose g € 2. Then d(q, p;) < 7; for every 7. Since the sequence 7; is bounded 
and M is complete, it follows that p; has a convergent subsequence, still denoted 
by p;. Its limit 


Po := lim p; 


satisfies 2 C B(pa,ra). 
We prove uniqueness. Let po, p1 € M such that 


2 C B(po.re) 1 B(pi. ra). 


Since the exponential map exp, : 7,M — M is a diffeomorphism (by Theo- 
rem 6.5.2), there exists a unique vector vp € T,,M such that p; = exp,, (vo). 
Denote the midpoint between pop and p; by 


mi= exp, (5U0)- 
Then it follows from Lemma 6.5.7 that 


d(po.4) +d(pi.qy — d(Po, pi)” 
2 4 
2 
ei Mo Pv) 


d(m,q) < 


for every q € Q. Since sup,cg d(m, q) = re (by definition of rg), it follows that 
d(po, pi) = 0 and hence po = pi. This proves Lemma 6.5.9. oO 


Proof of Theorem 6.5.6 Letq € M and consider the group orbit 


2 := {gq|g eG}. 
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Since G is compact, this set is bounded. Let rg and pg be as in Lemma 6.5.9. Then 
2 Cc B(pa,ra). Since G acts on M by isometries, this implies 


2 = g2C B(gpa,re) 


for all g € G. Hence it follows from the uniqueness statement in Lemma 6.5.9 that 
&Pa = Pa for every g € G. This proves Theorem 6.5.6. O 


6.5.3 Positive Definite Symmetric Matrices 


We close this section with an example of a nonpositive sectional curvature man- 
ifold which plays a key role in Donaldson’s approach to Lie algebra theory [17] 
(see Sect. 7.5.2). Let m be a positive integer and consider the space 


P= P(R”) := \P « Rm | p'=Ps o} (6.5.10) 


of positive definite symmetric m x m-matrices. (Here the notation ““P > 0” means 
(x, Px) > 0 for every nonzero vector x € R’”.) Thus ? is an open subset of the 
vector space $:= {5 €R”*’" | S' = S} of symmetric matrices and hence the tangent 
space of P is Tp P = S for every P € P. However, we do not use the metric inherited 
from the inclusion into S but define a Riemannian metric by 


n~ 


(P,, Po)p := trace(P; P~'P, P-') (6.5.11) 
for P € P and Pi Ps € TpP =S8. 
Theorem 6.5.10 The space P with the Riemannian metric (6.5.11) is a connected, 


simply connected, complete Riemannian manifold with nonpositive sectional cur- 
vature, and the distance function on P is given by 


d(P,Q) = yrace((log(P-20P-"/2))") (6.5.12) 


for P,Q €P. Moreover, P is a symmetric space and the group GL(m, R) of non- 
singular m x m-matrices acts on P by isometries via P +> gP.g' for g € GL(m,R). 


Proof See below. Oo 


Remark 6.5.11 Let V be an m-dimensional vector space and H C S*V* be the 
set of inner products on V. Define a Riemannian metric on FH by 


(hy, ho), 1= trace(S{S2), AC, S;-) = hi, (6.5.13) 
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for h € H and hy, ho € T,H = S*V*. Then every vector space isomorphism 
a: R” — V determines a diffeomorphism $y : H — P via 


gu(h) = P = h(wé,an) = (&, P~'n) pm. (6.5.14) 
The derivative of ¢, at / in the direction he T;, H is given by 
do,(h)h = P = PP =-a"'Sa, hO,S:):=h. (6.5.15) 


Thus ¢, is an isometry with respect to the Riemannian metrics (6.5.13) on HH 
and (6.5.11) on P. The ¢, form an atlas on HC with the transition maps @go(P) := 
bp 0b, (P) = SpaP hq where gpa := B-'a € GL(m, R). 


Remark 6.5.12 The submanifold 

Po := Po(R”) := {P € P| det(P) = 1} (6.5.16) 
of positive definite symmetric m x m-matrices with determinant one is totally geo- 
desic (see Remark 6.5.13 below). Hence all the assertions of Theorem 6.5.10 (with 


GL(m, R) replaced by SL(m, R)) remain valid for Po. 


Remark 6.5.13 Let VM be a Riemannian manifold and L C M be a submanifold. 
Then the following are equivalent. 


(i) Ify: 1 — M isa geodesic on an open interval J such that 0 € J and 
yO)eL, 0) € ToL, 
then there is a constant e > 0 such that y(t) € L for |t| < e. 
Gi) Ify: 7 — Lis asmooth curve on an open interval J and ®, denotes parallel 
transport along y in M, then 


P,(t,5)Tyis)L = TyayL Vs,tel. 


(iii) If y : J — L is a smooth curve on an open interval J and X € Vect(y) is a 
vector field along y (with values in 7M), then 


X*HEeTwybL Vitel => VX) ETL Vtel. 
A submanifold that satisfies these equivalent conditions is called totally geodesic. 
Exercise 6.5.14 Prove the equivalence of (1), (ii), (iii) in Remark 6.5.13. Hint: 


Choose suitable coordinates and translate each of the three assertions into condi- 
tions on the Christoffel symbols. 
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Exercise 6.5.15 Prove that Po is a totally geodesic submanifold of . Prove that 
P is diffeomorphic to the quotient GL(m,R)/O(m) via polar decomposition and 
that Po is diffeomorphic to the quotient SL(m,R)/SO(m). Hint: Consider the 
map GL(m,R) > P: gh Vgg". 


Exercise 6.5.16 In the case m = 2 prove that Po is isometric to the hyperbolic 
space HI. 


The proof of Theorem 6.5.10 is based on the calculation of the Levi-Civita con- 


nection and the formulas for geodesics and the Riemann curvature tensor in the 
following three lemmas. 


Lemma 6.5.17 Let I > P: tt» P(t) bea smooth path in P onan interval I C R 


and let 1 > 8: t +» S(t) be a vector field along P. Then the covariant derivative 
of S is given by 


. ff Aes 
VS= aa ie te 5PP Us. (6.5.17) 


Proof The formula (6.5.17) determines a family of linear operators on the spaces 
of vector fields along paths that satisfy the torsion-free condition 


V;0;P = Vi0sP 
for every smooth map R? > ? : (s,t) + P(s,t) and the Leibniz rule 
V(S1, S2)p = (VS1, S2) p + (Si, VS2) p 
for any two vector fields S; and Sj along P. These two conditions determine the 


covariant derivative uniquely (see Lemma 3.6.5 and Theorem 3.7.8). This proves 
Lemma 6.5.17. O 


Lemma 6.5.18 The geodesics in P are given by 


y(t) =P exp(tP-'P) 
= exp(tP P') P (6.5.18) 
=P app“? PP?) Pp? 


for P €P, Pe Tp? = §, andt € R. In particular, P is complete. 
Proof Thecurve y : R — P defined by (6.5.18) satisfies 


y(t) = P exp(tP~'P) a PP'y(t). 
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Hence it follows from Lemma 6.5.17 that 
Vy(t) =7O -—yOrva yO 
= p(t)- PP y(t) 
=0 


for every t € R. Hence y is a geodesic. Since the curve y : R > P in (6.5.18) satisfies 
y(0) = P and y(0) = P, this proves Lemma 6.5.18. O 


Lemma 6.5.19 For P € ?, S,T,A €& the curvature tensor on P is 


1 1 
Rp(S,T)A = ree eee = gAPUTP 'S 


1 1 
+ qiP SPA + gAP USP IT (6.5.19) 


- —F[sP- PP), APP. 


Proof Choose smooth maps P : R? — ? and A: R? > S and define S := 0,P 
and T := 0,;P. Then 0,T = 0;S and Rp(S,T)A = V,V;A — V;V;A. Moreover, 
by Lemma 6.5.17 we have V,A = 0,4 — AP'S — 4SP~'A and V,A = 0,A — 
SAP-'T —4TP~'A. Hence 


1 1 
Rp(S,T)A = 4,V,A— 5(ViA)P tS 2 55P VA) 


1 i 
—VsA + 5(VsA)P IT + 5TP'(V,A) 
1 1 
= a,(a,4 ee) ae ee =TP"'A) 
2 2 
1 1 —1 1 —l —1 
= 5 (a4 = de rir A)P Ss 
1 1 1 
= =SP-1(a,A a 2Ap lp = =TP-'A) 
2 2 2 
je ee. Eee 
= 3,(a.4 = APS = SP 4) 


1 1 1 
=(§ A= AP'S —_ SP Ale oT 
+5 (84-5 35? |) 


1 1 1 
-~TP'(0,A—=AP'S —=SP7'A). 
wT (a. 2 5 3° ) 
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A term by term inspection shows that the partial derivatives of A, S, T cancel 
because 0,7 = 0;S. Hence we are left with the drivatives of P, so 


Rp(S,T)A 

= SAP '(@,P)PIT + STP \(@,P)PA 

+ ZAPITP'S = aTP AP's - ZSPAP“T + ZSP"TPOA 
= 5AP“'(0,P)P-'S = SSP OPP A 

_ 7AP“SP“T — {SP 4P“T - GTP APs _ joPasP A, 


Insert 0, P = S, 0,P = T to obtain (6.5.19). This proves Lemma 6.5.19. O 


Proof of Theorem 6.5.10 The manifold P is obviously connected and simply 
connected as it is a convex open subset of a finite-dimensional vector space. That 
the map GL(m, R) x P > P: (g, P) + gPg' defines a group action of GL(m, R) 
on P by isometries follows directly from the definitions. The remaining assertions 
will be proved in three steps. 


Step 1 The manifold P has nonpositive sectional curvature. 


By Lemma 6.5.19 with A = T and equation (6.5.11) we have 
(S, Rp(S,T)T)p 
= trace(SP~'(Rp(S,T)T)P~') 


= —Ftrave(SP [SP EP TP ‘}) 


1 I 
= 5trace(SP "TP “SP TP ') = ztrace(SP ‘TP TP SP) 


= strace(X”) — strace(XTX), 


Cauchy—Schwarz inequality, we have 
trace(X7) = So xy Xi < so = trace(X'X). 
ij ij 
Thus (S, Rp(S,T)T)p <0 forall P € P and S,T € S. This proves Step 1. 


Step 2 P is a symmetric space. 


Fix an element A € P and define the map ¢: P > P by o(P) := APA 
for P € P. This map is a diffeomorphism, fixes the matrix A = ¢(A), and its 
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derivative at P € P is given by do(P)P a —AP"'P PA for P € Tp. Hence 
do(A) = —id. Moreover, (d¢(P)P)¢(P)~! = —AP™' PA™ and so 


|do(P)Plicp) = trace((AP~'P A~!)’) = trace((P~'P)’) = | P|, 
for all P € Pand P € Tp. Thus ¢ is an isometry and this proves Step 2. 
Step 3 The distance function on P is given by (6.5.12). 


Let P, Q € P. Then, since P is a Hadamard manifold by Step 1 and Lemma 6.5.18, 
there exists a unique matrix P € 8 such that eXPp (P) = = Q. By Lemma 6.5.18 this 
equation reads P!/? exp(P~!/2P P-'/2) P!/2 = Q. Thus S := Po!/?P p-l/? = 
log(P~'/?QP-"/?) and 


d(P, OY = |P|2 = trace(P P~! P P- *) = trace(S7), 


This proves Step 3 and Theorem 6.5.10. Oo 


Remark 6.5.20 Theorem 6.5.10 carries over to the complex setting as follows. 
Replace P by the space 


SSE" |) 0"=0>0} (6.5.20) 


of positive definite Hermitian matrices. Here Q* denotes the conjugate transposed 
matrix of 0 € C’”"*” and the notation “QO > 0” means z* Oz > 0 for every nonzero 
vector z € C”. Thus Q is an open subset of the vector space of Hermitian m x m- 
matrices. Define the Riemannian metric on Q by 


(H,, Ho) 9 = Re(trace(H, Q~' H,Q™')) 


for Q € Q and Hy, Hy € TgQ. Then all the assertions of Theorem 6.5.10 (with 
GL(m, R) replaced by GL(m, C)) carry over to Q. The proof is verbatim the same, 
with the transposed matric replaced by the conjugate tramsposed matrix and the 
trace replaced by the real part of the trace. 


Remark 6.5.21 The set 29 := {Q € Q| det(Q) = 1} of positive definite Hermi- 
tian matrices with determinant one is a totally geodesic submanifold of 9. Hence 
all the assertions of Theorem 6.5.10 (with GL(m, R) replaced by SL(m, C)) remain 
valid for Qo. 


Exercise 6.5.22 Show that Theorem 6.5.10 remains valid for Q, and Qp is a to- 
tally geodesic submanifold of Q. Prove that 9 is diffeomorphic to the quotient 
GL(m,C)/U(m) and Qp is diffeomorphic to SL(m,C)/SU(m). Hint: Con- 
sider the map SL(m,C) > 99: gh /gg*. Show that the pullback metric 
on SL(m, C)/SU(m) is given by the norm of the Hermitian part of the matrix g~!¢ 
for g € T,SL(m, C). 
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Exercise 6.5.23 In the case m = 2 prove that Qo is isometric to the hyperbolic 
space H?. 


The space 09 = SL(m, C)/SU(m) (with nonpositive sectional curvature) can be 
viewed as a kind of dual space of the Lie group SU(m) (with nonnegative sectional 
curvature). They have the same dimension and in both cases the Riemann curvature 
tensor is given by the Lie bracket (see equation (5.2.23) for SU(m) and equa- 
tion (6.5.19) for Qo). One can think of the noncompact Lie group GS := SL(m, C) 
as the complexification of the compact Lie group G := SU(m). It can be written in 
the form 


G° = {exp(in)u|u € G,n € g}, (6.5.21) 
the Lie algebra of G° is the complexification g° = g @ ig of the Lie algebra of G, 
and the quotient G°/G is a Hadamard manifold. These observations carry over to 
all Lie subgroups G C SU(m). For an exposition see [20]. 


Exercise 6.5.24 (Siegel upper half space) 


(i) The standard symplectic form wp on R2” = R” x R” is given by 
0 -l 


for z,¢ € R” and the space of wy-compatible linear complex structures is 
the (n? + n)-dimensional manifold 


J? =-1, JJo+ JoJ' =0 
R”, wo) = 4 J € RP ini te (65.22 
ey w(t, Jb) > Oforo Ate R2(° ©>-22) 
Define a Riemannian metric on J(M, wo) by 
(F 1,42) := trace(T 1 F2) (6.5.23) 


for Jigs € T;J(R2",@o). Show that the symplectic linear group Sp(2n) 
(Exercise 2.5.5) acts on the space J(R*",@o) by isometries J +> gJg™!. 
If J € Jo(R*",@o) and P := —JJy = P', show that wo(-, J-) = (-, P7!-). 
Show that the map J(R?",@ ) > Po(R2”) : J t —JJy is an Sp(2n)- 
equivariant isometric embedding, whose image is a totally geodesic sub- 
manifold of Po(R*”"). Deduce that 7(R2”, wp) is a Hadamard manifold and a 
symmetric space. For every J € J(R*”, wo) show that the map J’ —JJ/J 
is an isometry fixing J whose derivative at J is —id. 
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(i) Siegel upper half space is the manifold S, C C”*” of symmetric complex 
n X n-matrices with positive definite imaginary part [71]. The symplectic lin- 
ear group Sp(27) acts on this space via 


A B 
wZ = (AZ + BY(CZ+4+ Dy), = , 
g (AZ + B)\(CZ + D) g é ) 


A'C=C'A, B'D=D'B, A'D—C'B=1, 


(6.5.24) 


for g € Sp(2n) and Z € S,,. Show that this is a well-defined group action. 
(For hints see [49, page 72].) Show that there is a unique Sp(27)-equivariant 
diffeomorphism from S,, to J (R?”, @o) that sends il to Jo. Show that this map 
is given by the explicit formula 


xy-! -y—xy-'x 
HAy= ( y-! ayy 


€ J(R”",a), Z=X+iY €S). 


Show that the diffeomorphism S, — 7(R2",@o): Z +> J(Z) is an isometry 
with respect to the Riemannian metric on S,, given by 


|Z|2, = 2trace((Y-'X)? + (Y-¥)?) (6.5.25) 


forZ =X +i¥ €S,andZ =X +iY €TzS,. 


6.6 Positive Ricci Curvature* 


In this section we prove that every complete connected manifold M Cc R” whose 
Ricci curvature satisfies a uniform positive lower bound is necessarily compact. If 
the sectional curvature is constant and positive, this follows from Corollary 6.4.12 
as was noted in Example 6.4.16. 


Definition 6.6.1 (Ricci tensor) Let M@ C R” be an m-dimensional submanifold 
and fix an element p € M. The Ricci tensor of M at p is the symmetric bilinear 
form 


Ric, :T,M xT,M —R 


defined by 


m 


Ricy (u,v) := > (Rp(ej,u)v.e;), (6.6.1) 


i=1 


where €1,...,@m iS an orthonormal basis of 7, M. The Ricci tensor is independent 
of the choice of this orthonormal frame and is symmetric by equations (5.2.17) 
and (5.2.19) in Theorem 5.2.14. 
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6.6.1. The Ricci Tensor in Local Coordinates 


Let 6: U > 2 be a local coordinate chart on an open set U C M with values in 
an open set 2 C R’”, denote its inverse by wy := ¢ 7! : 2 > U, and let 


3 
E,(x):= va) ETyyM, x1eEQ, i=1,...,m, 


be the local frame of the tangent bundle determined by this coordinate chart. Denote 
the coefficients of the first fundamental form by g;; := (E;, E;) and the coefficients 
of the Riemann curvature tensor by RF , . & — Rs0 that 


R(E;, Ej )Ex = Yo Rix Ee. 
e 
(see Section 5.4). Then 


Ric;; := Ric(E;, E;) = > 5 = » Riga”. (6.6.2) 


v=l Bbv=l 


(Exercise: Prove this.) 


6.6.2 The Bonnet-Myers Theorem 


Theorem 6.6.2 (Bonnet-Myers) Let M C R" be acomplete, connected manifold 
of dimension m > 2 and suppose that there exists a 5 > 0 such that 


Ric,(v, v) > (m — 1)6|v|? (6.6.3) 


for every p € M andeveryv € T,M. Then d(p,q) < x//6 for all p,q € M and 
hence M is compact. 


The proof is based on the following lemma. 


Lemma 6.6.3 Let R x [0,1] > M : (s,t) % ys(t) be a smooth map such that 
y:= yo: [0,1] — M isa geodesic and y;(0) = y(0) and ys(1) = y(1) forall s € R. 


Define the vector field X along y by X(t) := Z| _o¥s (t) forO <t < 1. Then 


i 


2: 
EGY == / (VVX + R(X, Pp, X) dt 
=0 


ds? 


Ss 


(6.6.4) 


1 
= f(vxr — (R(X. p)y, x)) dt. 
0 
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Proof In the proof of Theorem 4.1.4 we have seen that 


1 


d 
SE (is) == f (Vs dirs(t) ders(t) at 
0 
for all s € R (see equation (4.1.10)). Differentiate this equation again with respect 
to s and use the identity V,d, = V;,0, to obtain 


2 
d 
72 Es) aa 


1 
[0a OsYs) dt 
s=0 
0 


if 


(Vs Vi0rYs5 Os Vs) arf Vi01Vs, Vs Os Vs) d 
0 


1 
| 
1 
== | (V,9B.r. + ROY. 8:7.)BrP4 Beye) at 
0 
1 
— f (81%. Yoder.) dt 

0 

Now take s = 0. Then V;0;y = 0 because y is a geodesic and hence 


1 


Fa) _ EO.) =~ [Wik + RO DP. X) at 
s=0 


2 


This proves the first equality in (6.6.4). To prove the second equality in (6.6.4) 
use integration by parts and the fact that X(0) = 0 and X(1) = 0. This proves 
Lemma 6.6.3. Oo 


Proof of Theorem 6.6.2. Let p,q € M. By the Hopf—Rinow Theorem 4.6.6 there 
exists a geodesic y : [0,1] — M such that 


y(0) = p, y() =4q, L(y) = d(p,q). 


Let X € Vect(y) be a vector field along y such that X(0) = 0 and X(1) = 0 
and define y;(t) := exp,;)(sX(t)) for s¢ R and 0 <¢ <1. Then s = 0 is the 
absolute minimum of the function R > R: s+ E(y;) by Lemma 4.4.1. Hence 
ae |;-o£ (ys) = 0 and by Lemma 6.6.3 this implies 


1 


1 
[tx yy, X) dt < [ivxorar (6.6.5) 
0 


0 
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Now assume p # g and choose an orthonormal frame £),...,£,, along y such 
that FE; = y/|y| and V,£E; = 0 fori = 1,...m. Define 


X(t) := sin(st) E; (t) 


fori = 1,...mand0 <t < 1. Then |V,X;(¢)| = a cos(zf) for alli and ft and 


m 


5(m — N|YO)|? < Ries (VO. 7O) = (REO. YOO, £0) 


i=2 


for 0 < t < 1. Multiply this inequality by sin’(st), integrate over the unit inter- 
val, and use the identities |y(t)| = d(p,q) and i. sin?(2t) dt = 1/2 to obtain the 
estimate 


6(m — 1) 


1 
A d(p.qy = [d0n=v sinter ive at 


0 
m 1 
< / (R(X). HO) PO. Xi) dt 
i=2 0 
m 1 
< [ivxorar 
i=2 0 


1 


(m—1) | 2? cos*(xt) dt 
! 


m*(m — 1) 
5 : 


Here the third step uses (6.6.5). Since m > 2 it follows from this estimate 
that d(p,q)* < 27/6 and this proves Theorem 6.6.2. oO 


A direct consequence of Theorem 6.6.2 is that every compact manifold with 
positive Ricci curvature has a compact universal cover and hence has a finite fun- 
damental group. 


6.6.3 Positive Sectional Curvature 


Corollary 6.6.4 Let M C R” be a complete, connected manifold of dimen- 
sion m > 2 and suppose that there exists a 5 > 0 such that 


K(p, E) > 5 
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for every p © M and every 2-dimensional linear subspace E C T,M. Then 
a 
d(p.q) S$ = 
V5 
for all p,q € M and hence M is compact. 


Proof The condition K > 6 implies (6.6.3) with m := dim(M) and hence the as- 
sertion follows from Theorem 6.6.2. This proves Corollary 6.6.4. O 


The example of the m-sphere shows that the estimate in Corollary 6.6.4 is sharp. 
Namely, M := S” has sectional curvature K = | and diameter z. 

The paraboloid M := (x, y,z) € R?|z = x* + y} has positive GauBian cur- 
vature and so positive Ricci curvature (Lemma 6.7.2) but is noncompact. 


Remark 6.6.5 (Sphere Theorem) The Topological Sphere Theorem asserts that 
every complete, connected, simply connected Riemannian m-manifold M whose 
sectional curvature satisfies the estimate 


1/4 < K(p,E) <1 


for every p € M and every 2-dimensional linear subspace E C 7,M must be 
homeomorphic to the m-sphere. The Differentiable Sphere Theorem asserts 
under the same assumptions that M is diffeomorphic to S’”. 

The problem goes back to a question posed by Heinz Hopf [31, 32] in the 1920s. 
After intermediate results by Rauch [57] (with 1/4 replaced by 3/4) and others, 
the Toplogical Sphere Theorem was proved in 1961 by Berger [6] and Klingen- 
berg [41]. The Differentiable Sphere Theorem was proved in 2007 by Brendle 
and Schoen [8-10]. They even weakened the assumption to 0 < max; K(p, E) < 
4ming K(p, E) for all p € M, where the maximum and minimum are taken over 
all 2-dimensional linear subspaces E C T,M. 

The Topological Sphere Theorem is sharp, as the suitably scaled Fubini—Study 
metric on complex projective space satisfies 1/4 < K(p, E) <1 for all p and E. 
The Differentiable Sphere Theorem is a significant improvement, because in many 
dimensions there exist smooth m-manifolds that are homeomorphic, but not diffeo- 
morphic, to S’”’. These are the so-called exotic spheres and many of those do not 
even admit metrics of positive scalar curvature [30]. (For the definition of scalar 
curvature see Sect. 6.7.) 


6.7. Scalar Curvature* 


This section introduces the scalar curvature and explains how it is related to the 
Ricci tensor and the Riemann curvature tensor in dimensions 2 and 3. The sec- 
tion also includes a brief discussion of several problems in differential geometry in 
which the scalar curvature plays a central role. 
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6.7.1. Definition and Basic Properties 


Let m be a positive integer and let M C R” be an m-manifold. For each ele- 
ment p € M denote by Rp: 7,M x 7,M — End(7,M) the Riemann curvature 
tensor and by Ric, : 7,M x T,M — R the Ricci tensor of M at p. 


Definition 6.7.1 (Scalar curvature) Fix an element p € M. The scalar curva- 
ture S(p) of M at p is the trace of the Ricci tensor and is given by 


m 


S(p) = D-Ricp(ei.e;) = D> (Role. e;)e;. ei), (6.7.1) 


i=l ij=l 


where é,...,@m is an orthonormal basis of 7; M. The scalar curvature is indepen- 
dent of the choice of this orthonormal frame. 


Lemma 6.7.2. Assume m = 2 and let K : M — R be the Gaufian curvature of M 
in (6.4.2). Then 


S(p) = 2K(p), Ric,(u, v) = K(p)(u, v), (6.7.2) 
(R,(u, v)w,z) = K(p)((u.z)(v, w) — (u, w)(v,2)) (6.7.3) 


forall p € M andu,v,w,z €T,M. 


Proof By definition, the Gaufian curvature is given by 


(R,(u, v)v, u) 


KW) = TPlyP— (vp 


(6.7.4) 


for every pair of linearly independent tangent vectors u,v € T,M. Take u to be a 
unit vector orthogonal to v to obtain Ric,(v, v) = K(p)|v|? for every v € T,M. 
Since Ric, : 7,M x T,M — R is a symmetric bilinear form, this implies the sec- 
ond equality in (6.7.2). With this understood the first equality in (6.7.2) follows 
directly from the definition of the scalar curvature in (6.7.1). Equation (6.7.3) fol- 
lows from (6.7.4) by Theorem 6.4.8. This proves Lemma 6.7.2. Ey 


Lemma 6.7.3 Assume m = 3. Then 


(R,(u, v)w, Zz) = Ric,(v, w)(u, z) — Ric,(u, w)(v, z) 


+ Ricp(u, 2)(v, w) — Ricp(v, z)(u, w) (6.7.5) 


= “Pu, 2) 0, w) — (u, w)(v, 2)) 


forall p € M andallu,v,w,z €T,M. 
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Proof Choose an orthonormal basis e1, e2, e3 of the tangent space 7, M and define 
the endomorphism 


Q,:1,M —T,M 
by 
Opu i= S> Rou, ee 


for u € T,M. The right hand side of this equation is independent of the choice of 
the orthonormal basis and the endomorphism Q,, satisfies 


trace(Q,) = S(p) 


and 


(Q,u,v) = Ric,(u, v) 


for all u,v € 7,M. With this notation equation (6.7.5) takes the form 


R,(u, v)w = Ricp(v, w)u + (v, u)(Q, ),) 
, (6.7.6) 
— Ric, (u, w)v — (u, u)(Q,0 = “), ) 


It suffices to verify equation (6.7.6) in the following three cases. 


(a) u,v are linarly dependent. 
(b) u,v, w are orthonormal. 
(c) u,v are orthonormal and w = v. 


In the case (a) both sides of equation (6.7.6) vanish. In the case (b) we have 
R,(u,v)w = (R,(u, v)w,uju + (R,(u, v)w, v)v 
= Ric,(v, w)u — Ric, (u, w)v, 
and this is equivalent to (6.7.6). 


In the case (c) equation (6.7.6) takes the form 


R,(u, v)v = Ric,(v, v)u — Ric,(u, v)v + Qpu — oP, (6.7.7) 


To verify this formula, choose a unit vector w that is orthogonal to u and v. Then it 
follows from the definition of S(p) and Q, that 


S@) 


7 (Rp(u, v)v,u) + (Rp(w, v)u, w) + (Rp(u, w)w, u) 


= Ric,(v, v) + (Rp(u, w)w, u), 
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and 
QO,u = R,(u,v)v + R,(u, w)w 
= R,(u,v)v + (Rp(u, w)w, v)v + (Rp(u, w)w,u)u 
S 
= R,(u, v)v + Ric,(u, v)v + Py — Ric,(v, v)u. 
This proves (6.7.7) and Lemma 6.7.3. Oo 


6.7.2. Scalar Curvature in Local Coordinates 


Let ¢: U > 2 be a local coordinate chart on an open set U C M with values in 
an open set {2 C IR”, denote its inverse by 


w:=o':2 50, 
and denote by 
E; (x) = 0; W(x) E Ty(x)M 


for x € 2 and i = 1,...,m the local frame of the tangent bundle determined 
by this coordinate chart. Denote the coefficients of the first fundamental form 
by gi; := (£;, E;), of the Ricci tensor by 


Ric;; = Ric(E; , E; ), 


and of the Riemann curvature tensor by Rjj;x¢ and R°,, so that 


ijk 


Rijxe = (R(Ei, Ej)Ex, Ee), R(Ey, Ej)Ex = D> Ri Ee. 
f=1 


Then the scalar curvature is the function S : 2 > R given by 
m 


S= >)" Rige’ = ) Rie’ = +) Rye”. (6.7.8) 


ij=l ijv=l ijkl=1 


m 


(Exercise: Prove this.) 


6.7.3 Positive Scalar Curvature 


An important question for a compact smooth manifold M is whether or not it admits 
a Riemannian metric of positive scalar curvature. A theorem of Lichnerowicz [46] 
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asserts that, if M is a compact spin manifold of dimension m = 4n that admits a 
Riemannian metric of positive scalar curvature, then a certain characteristic class of 
this manifold (the A-genus) must vanish. The definitions of the terms that appear in 
this sentence (spin structure and A-genus) as well as in the proof, which involves 
the Dirac operator, the Atiyah—Singer index theorem, and the Weitzenbéck formula, 
go beyond the scope of the present book. For an exposition see [65, Theorem 6.30]. 

A nonlinear variant of Lichnerowicz’ theorem asserts that a compact oriented 
smooth 4-manifold with by — b, odd and by > 1 that admits a Riemannian met- 
ric of positive scalar curvature has vanishing Seiberg-Witten invariants (see [65, 
Proposition 7.32]). 

In another direction Gromov and Lawson [22] proved that if M; and M) are two 
compact manifolds of dimension m > 3 that admit Riemannian metrics of positive 
scalar curvature, then so does their connected sum M,#M)p (see also [65, Theo- 
rem 2.18]). 

In the late 1970’s Schoen and Yau [70] proved, using minimal surfaces, that the 
torus T” = R”/Z'” does not admit a metric of positive scalar curvature form < 7. 
We remark that the A-genus of the torus vanishes and so Lichnerowicz’ theorem 
does not apply. In [22] Gromov and Lawson refined the techniques of Lichnerow- 
icz to prove that, for any m, the m-torus does not admit a metric of positive scalar 
curvature. In fact, they proved that for any compact spin manifold M of dimen- 
sion m the connected sum N := M#T” does not admit a metric of positive scalar 
curvature. Moreover, they proved that if N admits a metric of nonnegative scalar 
curvature, then this metric must be flat and N must be the standard m-torus. 


6.7.4 Constant Scalar Curvature 


An interesting class of Riemannian metrics consists of those that have constant 
scalar curvature. In dimension m = 2 the existence of such a metric is the content 
of the uniformisation theorem. The proof involves the solution of the Kazdan- 
Warner equation and goes beyond the scope of this book. For an exposition see [65, 
Theorem 2.20 and Theorem D.1]. 

In dimension two Lemma 6.7.2 shows that the constant scalar curvature condi- 
tion is equivalent to constant sectional curvature. However, in higher dimensions the 
constant scalar curvature condition is more general. By Corollary 6.4.12 every com- 
pact simply connected m-manifold with constant sectional curvature is diffeomor- 
phic to the m-sphere, while constant scalar curvature metrics exist on every compact 
manifold. Examples are the Fubini-Study metric on complex projective space (Ex- 
ample 2.8.5 and Example 3.7.5), locally symmetric spaces (Theorem 6.3.4), and 
products of Riemannian manifolds with constant scalar curvature. 


Definition 6.7.4 Let M be a Riemannian manifold with the metric g. A Rieman- 
nian metric g’ on M is called conformally equivalent to g iff there exists a smooth 
function A : M — (0,00) such that g’ = Ag. The set of all such Riemannian metrics 
is called the conformal class of g. 
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Remark 6.7.5 (Yamabe problem) The Yamabe problem asserts that the con- 
formal class of every Riemannian metric on a compact m-manifold M of dimen- 
sion m = 2 contains a metric of constant scalar curvature. 

This problem was formulated in 1960 by Hidehiko Yamabe and was eventu- 
ally settled in the affirmative in 1984 by the combined work of Hidehiko Yam- 
abe [78], Thierry Aubin [5], Neil Trudinger [75], and Richard Schoen [69]. The 
proof for a compact manifold M of dimension m > 2 relies on finding a positive 
function f : M — R and areal number c that satisfy the Yamabe equation 

4(m Y Ay ate SF = epee (6.7.9) 
m—2 
Here S, : M — R denotes the scalar curvature of the Riemannian metric g and A, 
denotes its Laplace—Beltrami operator. In local coordinates this operator is given 
by the formula 


(6.7.10) 


1 0. a 
Ag = —— Vi gil Vaet(g)——. 
- /det(g) dX ox! e et(g) axs 


If f : M — (0,00) is a positive solution of (6.7.9), then the Riemannian met- 
ric f4/“"-) g has the constant scalar curvature c. Exercise: Prove this. Hint: Show 
first that 


S,2 


wg =U Se + 2(m— lu Agu — (m — 1)(m — 4)u* | dul. (6.7.11) 
Then take f := u”/?~! and use the identities 


|du"|2 = nu"? |dul?, (6.7.12) 


Agu” = nu" Agu +n(n—1)u"|dul? (6.7.13) 
for a smooth function vu : M — (0,00) and areal number n > 0. 


Examples of constant scalar curvature metrics arise from the Einstein condition 
(see Lemma 6.7.7 below). 


Definition 6.7.6 (Einstein metric) A Riemannian manifold M is called an Ein- 
stein manifold iff its Ricci tensor is a scalar multiple of the first fundamental form, 
i.e. there exists a smooth function 1 : M > R 


Ric, (u,v) = A(p){u, v) (6.7.14) 


for all p € M andallu,v € T,M. It follows from the definitions that the factor A 
in (6.7.14) is related to the scalar curvature S by 


A=". (6.7.15) 
m 


Lemma 6.7.2 shows that every Riemannian metric on a 2-manifold is an Einstein 
metric and the factor A = K is the GauBian curvature. 
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Lemma 6.7.7 Let M be an Einstein manifold of dimension m = 3. Then the scalar 
curvature of M is locally constant. 


Proof Let p € M. Then there exists a local orthonormal frame of the tangent 
bundle F,,..., Em € Vect(U) in a neighborhood U of p such that the covariant 
derivatives V FE; all vanish at p. (Exercise: Prove this.) Denote the deriviative of a 
function f at p in the direction E;(p) by 0; f. Then 


0 = >> ((Vz, R)(E;, Ex) Ex, Ej) 
Jk 
+ Y(Ve, RYEx, EEe, Ey) + (Ve, RE; Ey) Ex. E)) 
ve dk 


jk 
= 9; )\(R(Ej, Ex) Ex, Ej) 
ik 


+500; S\ (REx. E;) Ex, E;) + Y- de Y°(R(E;, Ej) Ex. Ej) 
7. k k 


J 


0;8 —)° a;Ric(E;, Ej) — ) > d,Ric(E;, Ex) 
j k 


Here the the first equality follows from the second Bianchi identity (6.3.7), the 
second holds at p because V E£; (p) = 0, the third follows from the definitions of Ric 
and S, and the last uses the identity Ric(£;, E;) = 6;; S/m, which holds by (6.7.14) 
and (6.7.15). Since m > 3 it follows that the derivative of S vanishes everywhere, 
and this proves Lemma 6.7.7. Oo 


Examples of Einstein metrics include all constant sectional curvature metrics by 
Theorem 6.4.8, the Fubini-Study metric on complex projective space, and all met- 
rics with vanishing Ricci tensor. Examples of the latter are Calabi-Yau metrics on 
complex manifolds and G-structures on 7-manifolds. These are again subjects that 
go far beyond the scope of this book. In general, the construction of Einstein met- 
rics and the question of their existence is a highly nontrivial problem in differential 
geonetry. The study of this problem has a long history and there are many deep 
theorems and interesting open questions about this subject. 


6.8 The Weyl Tensor* 


This section introduces the Weyl tensor and explains some of its basic properties. 
The section closes with brief discussions of locally conformally flat metrics and 
self-dual four-manifolds. 
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6.8.1 Definition and Basic Properties 

Let m be a positive integer and let M CR” be an m-manifold. For each el- 
ement p € M denote by R,:7,M x T,M — End(7,M) the Riemann curva- 
ture tensor and by Ric, : 7,M x T,M — R the Ricci tensor of M at p. Also 
let S : M — R be the scalar curvature in Definition 6.7.1. 


Definition 6.8.1 (Weyl tensor) Assume m > 3. The Weyl tensor of M at an 
element p € M is the bilinear map 


W, : T>M x T,M — End(T,M) 
defined by 
(W,(u, v)w, Zz) := (Rp(u, v)w, z) 


1 
- —— (Riep(v. w)(u,z) — Ric,(u, w){v, z}) 


——— 6.8.1 

~ —* (Rip (u,2)(0, 0) — Ric) (v, z)(w, w)) (684) 
S(P) 

+ Gar ayGw 2a (20. w) — (0, 2)(u, 0) 


foru,v,w,z € T,M. 


Lemma 6.7.3 shows that the Weyl tensor vanishes in dimension three. In higher 
dimensions the Wey] tensor may be nonzero. The next lemma summarizes the basic 
algebraic properties of the Weyl tensor. 


Lemma 6.8.2 The Weyl tensor W,:T,M x T,M — End(T,M) at an ele- 
ment p © M is a skew-symmetric bilinear map with values in the space of 
skew-adjoint endomorphisms of T,M and, for all u,v,w,z €T,M and every 


orthonormal basis e€,,...,€m Of T,M, it satisfies 
(W,(u, v)w,z) = (W,(w,z)u, v), (6.8.2) 
W,(u, v)w + W,(v, w)u + W,(w,u)v = 0, (6.8.3) 
Y-(W,(ei,u)v,e) = 0. (6.8.4) 


i=1 


Proof The skew-symmetry of the Weyl tensor and equation (6.8.2) follow directly 
from the definition and Theorem 5.2.14. It then follows from (6.8.2) that W,(u, v) 
is a skew-adjoint endomorphism of 7, M for all u,v € T,M. The verification of the 
Bianchi identity (6.8.3) is a straight forward computation which we leave as an ex- 
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ercise. To prove (6.8.4), letu, v € 7M and choose an orthonormal basis e,..., @m 
of T,M. Then 


m m 


Yo (Wp (e:,u)v,e)) = D2 (Rp(ei,u)v, €;) 


i=1 i=1 
m 


1 
ear 2 (Rico v){e;, e;) — Ric, (e;, v)(u, ¢:)) 


1 m 
-— 2 (Riese e;)(u, v) — Ricy(u, e;)(e;, »)) 


m 


S(p) 
+ a= ono) Di ((ere1te v) — (u, e;)(e;, v)) 
=> Ric, (u, v) 


-1 
-- Ric (w, v) 


1 : 
= —5Riep(e, v) 


This proves (6.8.4) and Lemma 6.8.2. Oo 


6.8.2 The Weyl Tensor in Local Coordinates 


Let 6: U > 2 be a local coordinate chart on an open set U C M with val- 
ues in an open set 2 C R”, denote its inverse by w := @ |: Q > U and 
let Ej (x) := 0;W(x) € TyyyM for x € Q and i = 1,...,m be the be the lo- 
cal frame of the tangent bundle determined by this coordinate chart. Denote 
the coefficients of the first fundamental form by gj; := (£;, Ej), of the Ricci 
tensor by Ric;; := Ric(£;, E;), and of the Riemann curvature tensor by Rig 
Let S = )), , Ricyg’? = D7"), Rig) be the scalar curvature in local coor- 
dinates. Then the cofficients Wie : 22 — R of the Weyl tensor are defined by 
W(E;, E))Ex = dy Wis E-¢. and they can be expresses in the form 


Wijee = (W(Ei, Ej)Ex, Ee) = D0 Wiese 
S 
= \* RY 8ve + ——-—_( geen — 8128; 8. 
: inSve + pg aa (see ik je) (6.8.5) 


1 ; . 
ws (Ricjx gic — Ric gje + Riciegjx — Ricjegit). 
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6.8.3 Conformal Invariance 


Definition 6.8.3 (Locally conformally flat metric) Let M be a Riemannian mani- 
fold. The metric g on M is called locally conformally flat, iff for each p ¢ M 
there exists a Riemannian metric on M that is conformally equivalent to g (see 
Definition 6.7.4) and flat in a neighborhood of p. 


By the local C-A-H Theorem 6.1.17 a Riemannian m-manifold M is locally 
conformally flat if and only if each p € M has an open neighborhood U that is 
conformally diffeomorphic to an open subset {2 C R”, ie. there exists a coor- 
dinate chart ¢ : U — Q and a smooth function A : U — (0,00) such that |v| = 
A(p)|db(p)v|R™ for all p € U andallv € 7,M. 

A remarkable property of the Weyl tensor is that it remains unchanged under 
multiplication of the Riemannian metric by a positive function and so is an invari- 
ant of the conformal class of the metric. This is easy to see when the function is 
constant. In that case the Riemann curvature tensor and the Ricci tensor remain 
unchanged, the scalar curvature gets multiplied by the inverse, and so the Weyl ten- 
sor remains unchanged. As Remark 6.7.5 shows, the situation is more complicated 
when instead of multiplying the metric by a constant, we multiply it by a noncon- 
stant positive function. The conformal invariance of the Weyl tensor can then be 
proved by a somewhat cumbersome calculation in local coordinates. 


Remark 6.8.4 This discussion shows that the Weyl tensor vanishes for every 
Riemannian metric that is locally conformally flat. In fact, it turns out that in di- 
mension m > 4 the Weyl tensor of M vanishes if and only if the Riemannian metric 
on M is locally conformally flat (see [43]). Moreover, a theorem of Kuiper [33, 42] 
asserts that every compact, connected, simply connected Riemannian m-manifold 
that is locally conformally flat is conformally diffeomorphic to the m-sphere with 
its constant sectional curvature metric. Thus every compact, connected, simply con- 
nected Riemannian manifold of dimension m > 4 that is not diffeomorphic to S$” 
must have a nonvanishing Wey] tensor. 


6.8.4 Self-Dual Four-Manifolds 


The lowest dimension in which the study of the Wey] tensor is interesting is m = 4. 
To explain this, it is useful to consider the notion of an oriented manifold. 


Definition 6.8.5 (Orientation) An orientation of an m-manifold M is a col- 
lection of orientations of the tangent spaces 7,M, one for each p € M, that 
depend continuously on p, i.e. if E,,..., E are pointwise linearly independent 
vector fields in a connected open neighborhood U Cc M of p and the vec- 
tors E,(p),...,£m(p) form a positive basis of T,M, then for every g € U the 
vectors £\(q),..., £m(g) form a positive basis of 7, M. In the intrinsic language 
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an oriented manifold is one equipped with an atlas such that all the transition maps 
are orientation preserving diffeomorphisms. 


Definition 6.8.6 (2-Form) Let M be a smooth manifold. A 2-form on M is 
a collection of skew-symmetric bilinear maps w, : T7,M x T,M — R, one for 
each p € M, which is smooth in the sense that for every pair of smooth vector 
fields X,Y on M the assignment p +> w,(X(p), Y(p)) defines a smooth function 
on M. The space of all 2-forms on M is denoted by 27(M). 


In the similar vein the Weyl tensor of a Riemannian m-manifold M can 
be thought of as 2-form with values in the endomorphism bundle End(7M). 
By the symmetry properties in Lemma 6.8.2 the Weyl tensor induces a linear 
map W : 27(M) — 2?(M) via the formula 


(Wo),(u.v):= > (W,(u, v)e;,e;)@p(e;.e;) (6.8.6) 
l<i<j<m 
for w € 27(M), p € M,andu,v € T,M, where e,...,@m is an orthonormal ba- 


sis of T, M. The right hand side of equation (6.8.6) is independent of the choice of 
this orthonormal basis and is a 2-form by Lemma 6.8.2. 

Now let M be an oriented Riemannian 4-manifold. Then a 2-form @ on M is 
called self-dual iff it satisfies the condition 


Wp (0, €1) = Wp (2, €3) (6.8.7) 


for every p € M and every positive orthonormal basis eo, €;, €2, €3 of T,M. It is 
called anti-self-dual iff it satisfies (6.8.7) for every p € M and every negative 
orthonormal basis 0, €1,€2,e3 of 7,M. Thus q is anti-self-dual if and only if 
it is self-dual for the opposite orientation. Denote the space of self-dual 2-forms 
by 27+(M) and the space of anti-self-dual 2-forms by 27 (M). Then there is a 
direct sum decomposition 


2°(M) = 27+ (M) @ Q2?-(M). 


Lemma 6.8.7 Let M be an oriented Riemannian 4-manifold. Then the linear op- 
erator W : 2?(M)— 2?(M) in (6.8.6) preserves the subspace of self-dual 2-forms 
and the subspace of anti-self-dual 2-forms. 


Proof Fix any orthonormal basis éo, 1, €2, e3 of 7, M and abbreviate 
Wijkl >= (W, (e, €; ex, €e) 
fori, j,k, € € {0,1, 2,3}. Then by Lemma 6.8.2 we have 


Wijke = —Wjike = —Wijek = Wkeij. Wijke + Wjie + Weije =9, (6.8.8) 
Woijo + Wil + Waij2 + W3ij3 = 0 (6.8.9) 
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for alli, 7, k, £. It follows from (6.8.8) and (6.8.9) that 


Wo102 = W331; Wo0103 = W312; Wo0203 = W3112; (6.8.10) 


W2302 = W0131; W2303 = Wo112; W3103 = Wo212- 


Namely, by (6.8.8) each of these six identities is equivalent to an equation of the 
form aa wiki = O with 7 ¥ k and this holds by (6.8.9). Use (6.8.8) and (6.8.9) 
again to obtain 


Wo101 — W2323 — Wo202 + W3131 
= W220 + W1221 + W3223 — Wo110 — W112 — W3113 
=0 


and hence, by cyclic permutation of 1, 2, 3, 
€ = Wo101 — W2323 = Wo202 — W3131 = Wo303 — W1212- 
Since Wo101 + Wo202 + Wo303 = 0 by (6.8.9), this implies 
3€ = W221 + W332 + W1331 = W1221 — W330 = W303 — W1212 = €. 
Thus e = 0 and so 
Wo10l = W2323, W0202 = W3131; W0303 = W1212- (6.8.11) 


Now assume that t is a nonzero self-dual 2-form and fix an element p € M. Then 
there exists a positive orthonormal basis eo, e),é@2,e3 of 7,M and a real num- 
ber A ¥ 0 such that t(eo, e1) = t(e2, e3) = A and t(e;, e;) = 0 for all other pairs 
i,j. Hence (Wr)(e;,e;) = Aworj + Aw23;; for all i and j. Take A = 1 and use 
the equations (6.8.8), (6.8.10), and (6.8.11) to obtain 


(Wr)(e0, 1) = Wo101 + Wa301 = Wo123 + W323 = (WT) (Eo, €3), 
(Wr)(eo, €2) = Wo102 + W2302 = Woi31 + W331 = (WT)(e3, e1), 
(Wr)(e0, €3) = Wo103 + W2303 = Wo112 + W312 = (Wr)(e1, e2). 


Thus Wr is self-dual. The anti-self-dual case follows by reversing the orientation. 
This proves Lemma 6.8.7. Oo 


Lemma 6.8.8 Let M be an oriented Riemannian 4-manifold and denote by W : 
2?(M) — 2?(M) the linear operator determined by the Weyl tensor via (6.8.6). 
Then the following are equivalent. 


(i) Ift € Q2?(M) is anti-self-dual, then Wt = 0. 
(ii) The Weyl tensor W satisfies satisfies the equation 


W, (C0, €1) = Wy (ea, e3) (6.8.12) 


for every p € M and every positive orthonormal basis eo, €), €2, €3 of T,M. 
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Proof We prove that (i) implies (ii). Thus assume (i) and choose a positive or- 
thonormal basis éo, €1, €2, €3 of TM. Let t be the 2-form defined by 


T(€, €1) = 1, T(€2, €3) = —-l 


and t(e;, e;) :=0 for all other pairs 7, 7. Then t is anti-self-dual and hence Wr=0 
by (i). Thus it follows from (6.8.2) and (6.8.6) that 


(W(eo, e1)u — W(eo, e3)u, v) = (Wr)(u,v) = 0 
for all u,v € T,M and this proves (ii). 


Conversely, suppose that (11) holds, choose a positive orthonormal basis 
0, €1, €2, €3 Of TM, and let t be an anti-self-dual 2-form. Then 


Ay := T(€0, €1) = —T(€2, €3), 
Ag := To, €2) = —7(e3, €1), 
A3 = T(€o, 3) = —T(e1, €2) 
and hence 
Wr = A, ((W(e, e1) — Wea, €3))-.*) 
+ A2((W(eo, €2) — Wes, €1))-.*) 
+ A3((W(eo, €3) — W(e1, e2))-,*) = 0 
by (ii). This proves Lemma 6.8.8. O 


Definition 6.8.9 An oriented Riemannian 4-manifold is called self-dual iff its 
Weyl tensor satisfies the equivalent conditions of Lemma 6.8.8. It is called anti- 
self-dual iff it is self-dual for the opposite orientation. 


Examples of self-dual 4-manifolds are all 4-manifolds with constant sectional 
curvature by Theorem 6.4.8, or more generally all locally conformally flat 4- 
manifolds such as S$! x $?. Other examples are the complex projective plane with 
its Fubini-Study metric (Examples 2.8.5 and 3.7.5), and Ricci flat Kahler surfaces 
with the opposite of the complex orientation (the K3-surface and the Enriques 
surface). Compact simply connected smooth 4-manifolds with signature zero that 
are not diffeomorphic to the 4-sphere, such as S? x S? or the one-point blowup 
of the projective plane, do not admit any self-dual metrics by Kuiper’s theorem 
(Remark 6.8.4), because every self-dual metric on such a manifold is locally con- 
formally flat. For a survey of these basic examples see Kalafat [34]. 

The study of self-dual 4-manifolds was initiated by Penrose [56] and Atiyah— 
Hitchin—Singer [4] and was later taken up by many authors including Taubes [73], 
LeBrun [45], and Donaldson [18]. 

In [18] Donaldson proposes to study the moduli space of self-dual metrics on 
a compact oriented smooth 4-manifold (without boundary) modulo the action of 
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the group of diffeomorphisms. This is a very difficult problem. The self-duality 
equation is a system of nonlinear partial differential equations and they do give rise 
to a “finite-dimensional moduli space”. However, this space may be highly singular 
and it is noncompact unless it is empty. It would need to be “compactified” in a 
suitable way, one would need to find a way to understand the singularities, and 
one would have to assign to it some kind of “virtual fundamental class” to obtain 
numerical invariants by pairing this class with suitable cohomology classes in the 
ambient space. 


Check for 
updates 


Topics in Geometry 


This chapter explores various topics in differential geometry that are central to the 
subject and accessible with the material covered in this book, but go beyond the 
scope of a one semester lecture course. The first section builds on the material in 
Chap. 4 about geodesics and can be read directly after that chapter. It introduces 
conjugate points on geodesics and proves the Morse Index Theorem. This result 
is then used to show that every geodesic without conjugate points minimizes the 
length locally (and strictly) in the space of all curves joining the same endpoints 
and, conversely, that locally minimizing geodesics have no interior conjugate points 
(Sect. 7.1). This result in turn plays an essential role in the proof of the Continu- 
ity Theorem for the injectivity radius (Sect. 7.2). The next section examines the 
group of isometries of a connected Riemannian manifold and contains a proof of 
the Myers—Steenrod Theorem, which asserts that the isometry group always admits 
the natural structure of a finite-dimensional Lie group (Sect. 7.3). The proof given 
here has several parallels to the proof of the Closed Subgroup Theorem. This sec- 
tion is based on the study of isometries and the Riemann curvature tensor in Chap. 5 
and can be read directly after that chapter. The present chapter then deals with the 
specific example of the isometry group of a compact connected Lie group equipped 
with a bi-invariant Riemannian metric (Sect. 7.4). The last two sections are de- 
voted to Donaldson’s differential geometric approach to Lie algebra theory [17]. 
They build on the material in Chap. 6 and include Donaldson’s existence theorems 
for critical points of convex functions on Hadamard manifolds (Sect. 7.5) and his 
existence proof for symmetric inner products on simple Lie algebras (Sect. 7.6). 
Corollaries include the uniqueness of maximal compact subgroups, and Cartan’s 
theorem about the compact real form of a semisimple complex Lie algebra. 


7.1 Conjugate Points and the Morse Index* 


This section introduces conjugate points on geodesics and contains a proof of the 
Morse Index Theorem. As an application we prove that every geodesic without 
conjugate points minimizes the length and the energy locally (and strictly) among 
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all nearby curves joining the same endpoints. Conversely, locally minimizing 
geodesics have no interior conjugate points. The results of this section will be used 
in Sect. 7.2 to prove the Continuity Theorem for the injectivity radius. Assume 
throughout that M C R” is a smooth m-manifold and recall the notation 2p 4 
for the space of all smooth paths y : [0,1] > M with the endpoints y(0) = p 
and y(1) = q (Sect. 4.1.2). 


7.1.1 Conjugate Points 


We have seen in Theorem 4.4.4 and Corollary 4.4.6 that geodesics minimize the 
length on short time intervals. A natural question arising from this observation 
is how long the time interval can be chosen on which a geodesic minimizes 
the length at least locally in some neighborhood in the space of paths. An 
answer to this question is closely related to the Hessian of the energy func- 
tional E : 2,4 > R in (4.1.2). It was established in Lemma 6.6.3 that the 
Hessian of the energy functional EF at a geodesic y : [0,1] > M is the linear 
operator H,, : Vecto(y) — Vect(y) defined by 


H,X := -VVX — R(X, py. (7.1.1) 


The domain of this operator is the space Vecto(y) of all vector fields X along y that 
vanish at the endoints, i.e. X¥(0) = 0 and X(1) = 0. Recall that a Jacobi field along 
a geodesic is a solution of the equation 


VVX + R(X, y)p =0, (7.1.2) 


and so the kernel of H,, is the space of Jacobi fields along y that vanish at the 
endpoints (Lemma 6.1.18). 


Definition 7.1.1 Let y : [a,b] > M be a geodesic. A conjugate point is a real 
number Tt in the interval a < t < b such that there exists a nonvanishing Jacobi 
field X along the restriction y|{q,7] that satisfies X(a) = 0 and X(t) = 0. The di- 
mension of the space of solutions of this equation is called the multiplicity of the 
conjugate point t and will be denoted by 


VVX + R(X,y)) = 0, 


X(a) = 0, X(t) = 0 ne 


m,(t) := dim) X € Vect(y|ja,7) | 


Thus m,(t) = 0 whenever t is not a conjugate point of y. 


In Sect. 4.4 we have addressed the question when a geodesic y : [0, 1] — M with 
the endpoints y(0) = p and y(1) = g minimizes the lengths of curves with the same 
endpoints globally, i.e. when it satisfies L(y) = d(p,q). A weaker variant of this 
question is whether it minimizes the length locally, i.e. only among nearby curves 
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in 27,4. Here the word “nearby” can have several meanings, depending on which 
topology on the space S2,, is used. The relevant topologies in the present setting are 
the C! topology and the C® topology. Since our manifold M is embedded in R” 
these topologies are induced by the distance functions defined by 


doy) = sup YO-y'Ol+ sup 7O- HOM 
Sts <t< 
. Ly(t)-y! (7.1.4) 
<t< % t t 
dcx(y,y') = pie suPy<r<i |Ge (V(t) — v'())I 


k 
tol + Supper SV) — v'()| 


for y, y’ € 92,4. Note that, if y € 92,4 satisfies min, |y(¢)| > 0, then so does every 
curve y’ in a sufficiently small C '-neighborhood of y in 2,4. 


Theorem 7.1.2 Let y : [0,1] — M be a nonconstant geodesic with the endpoints 
y(0) = pand y(1) = q. Then the following holds. 


(i) Ifthere exists a C®-open neighborhood U C Qy 4 of y such that every curve y' € 
U satisfies L(y') => L(y), then y has no conjugate points t in the open inter- 
val0<t <1. 

(ii) If y has no conjugate points t in the interval 0 < t <1, then there ex- 
ists a C!-open neighborhood U C Qyq of y such that every curve y' € U 
satisfies L(y') > L(y) with equality if and only if there exists a diffeomor- 
phism p : [0.1] — [0, 1] such that p(0) = 0, p(.) = 1, and y' = yop. 


The proof will be based on the Morse Index Theorem explained below. 


Example 7.1.3. The archetypal example of a conjugate point is the endpoint of 
a geodesic y : [0,1] > S? on the unit sphere S? C R? of length z. This is the 
endpoint of an arc around half of a great circle, and at this point the geodesic seizes 
to be locally unique as there is a continuous family of geodesics joining north and 
south pole (the meridians). 


Example 7.1.4 The absence of conjugate points does not signify that a geodesic 
y € Qyq minimizes the length globally. The sinplest example is a geodesic around 
the unit circle M = S! of length L(y) > x. Similar examples exist on the torus 
M = T”, on the cylinder M = R x S!, and also on simply connected manifolds 
such as the sphere S* with a suitably chosen Riemannian metric (so that an open 
subset U C S? is isometric to a cylinder (0, 1) x S', for example). 


7.1.22 The Morse Index Theorem 


Definition 7.1.5 (Morse index) Let y : [0,1] ~ M be a geodesic. The Morse 
index of y is defined as the maximal dimension of a linear subspace X C Vecto(y), 
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such that 


1 


f(vxr- (R(X, 7, X)) dt <0 (7.1.5) 
0 


for every nonzero element X € X. It will be denoted by 


(7.1.6) 


uy) = nai) X is a linear subspace of Vecto(y) | 


and (7.1.5) holds for all X¥ € X \ {O}(" 


This is also the number of negative eigenvalues, counted with multiplicities, of the 
operator H, in (7.1.1). 


Theorem 7.1.6 (Morse Index Theorem) Let y : [0,1] > M be a geodesic. Then 
y has only finitely many conjugate points t, each with multiplicity 1 < m,(t) < m, 
and the Morse index of y is the number of conjugate points in the open interval 
0 <t < 1, counted with multiplicities, i.e. 


wy) = > m,(2). Ci) 


O<r<l 


Exercise 7.1.7 Prove that geodesics on manifolds with nonpositive sectional cur- 
vature have no conjugate points. So their Morse indices are zero. 


Exercise 7.1.8 Find geodesics on the unit sphere M = S? C R? with arbitrarily 
large Morse index. 


The proof of Theorem 7.1.6 requires some background in functional analysis 
which goes beyond the scope of this book and for which the interested reader is 
referred to [12]. Remark 7.1.9 below lists the main properties of the operator H, 
that are used in the proofs of Theorems 7.1.2 and 7.1.6. It will be convenient in 
some places to use the L? inner product 


1 


fi X(t), Y() d 


0 


(X,Y)72: 


and the associated norm 


1 


fixora 


0 


|X Ilz2 : 


for vector fields X, Y along y. 
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Remark 7.1.9 (Properties of Hf) We begin with the properties that can be easily 
verified directly, without an appeal to Hilbert space theory. 


(a) Each eigenvalue of Hy is a real number. 
This assertion is a consequence of the fact that the operator H, is symmet- 
ric, i.e. (HX, Y)72 = (X, H,Y)/2 for all X,Y € Vecto(y). (Exercise: Verify 
assertion (a) by complexifying the space of vector fields along y.) 

(b) Each eigenvalue of Hy has multiplicity at most m. 
The eigenspace of H, for an eigenvalue A consists of solutions of the equa- 
tion -VVX — R(X, y) = AX with X(0) = 0. Every solution is uniquely de- 
termined by V X(0) and is an eigenfunction for A if and only if X(1) = 0. 

(c) Every X € Vecto(y) satisfies the Poincaré inequality 


1 1 
1 
fixor < a f WxPar, (7.1.8) 
a 
0 0 
To see this, choose a parallel orthonormal frame £)(t),..., Em(t) € TywM 


of the tangent bundle along y and write X(t) = )°7"_, & (¢) E(t). Then the in- 
equality (7.1.8) takes the form 2? ic |E\? dt < fo ||? dt and this can be proved 
by writing & as a Fourier series &(t) = °°, sin(wkt)&. (Exercise: Prove the 
estimate (7.1.8). Verify that it is sharp.) 

(d) Let cy := sup, sup),j=1(R(v, Y(t) 7), v). Then, for all X € Vecto(y), 


1 


1 
[(ivxP (Ree. 7.)) dt = GP =o) f xP at 
0 


0 


This follows directly from the Poincaré inequality in (c). 
(e) Let A be an eigenvalue of H,. Thend > 1? — cy. 
There exists a nonzero element X € Vecto(y) such that H, X = AX. By (d) this 
implies A||X [72 = (X, HX) 2 = (x? — cy) ||X||j2 and so A > 2? — cy. 
(f) The set o(H,) of eigenvalues of Hy is a discrete subset of R. 
Let A € o(H,) and let I7, be the orthogonal projection onto the eigenspace 
of A. Then the operator H, — Aid + IT, : Vecto(y) — Vect(y) is bijective, and 
hence so is the operator H, — A’id + IT, for A’ sufficently close to A. Such a 
number A’ cannot be an eigenvalue of H,,. This argument uses the fact that 11, 
is a Fredholm operator between appropriate Sobolev completions. 
The smallest eigenvalue of H, is the supremum of all real numbers a that satisfy 
the inequality 


Ya 


(g 


1 
i (IvxP - (R(X, )y, x)) dt 20 [X(t)|? dt (7.1.9) 
0 


for all X € Vecto(y). 
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To prove this, define a € R to be the infimum of the integrals on the left 
in (7.1.9) over all X € Vecto(y) with || X ||,2 = 1. This infimum is finite by (d). 
Now choose a minimizing sequence X; € Vecto(y) with ||X;||,2 = 1. This se- 
quence satisfies a uniform upper bound on || VX; ||;2. Hence, by the theorems of 
Banach—Alaoglu and Arzela-Ascoli, there exists a subsequence that converges 
weakly in the Sobolev space W! and strongly in the supremum norm to a 
weak solution of the equation -VVX — R(X,y)y = aX with zero boundary 
condition. Now one can verify by a bootstrapping argument that every weak 
solution is smooth. Hence (7.1.9) holds, a is an eigenvalue of Bis, and every 
eigenvalue A of H, satisfies A > a. 

The Morse index of y is finite. 

The operator H,, has only finitely many negative eigenvalues by (e) and (f), and 
the direct sum X,, of their eigenspaces is finite-dimensional by (b). Moreover, 
every nonzero element X € X,, satisfies the inequality (X,H,X))2 < 0. One 
can then repeat the argument sketched in (g) for the L* orthogonal complement 
of X, to show that (X,H,X)7,2 > 0 for all X € ko Hence the dimension 
of X,, is the Morse index of y (Definition 7.1.5). 


(h 


wm 


Proof of Theorem 7.1.6 Choose a parallel orthonormal frame 
E\(t),...,Em(t) € T,~)M 
and, for 0 < ft < 1, define the symmetric matrix S(t) = S(t)’ € R”*” by 
S(t) = (SiO) i=1> Si) = (REVO), YO) O), £0), 
Let J := [0, 1] and define the operator A : C[°U/, R”) — C™(U.R”) with the do- 
main Cy° (1, R”) := {§ € CU, R”) | (0) = &(1) = Oj, by 
A€ := —& — SE. 


This operator is isomorphic to the operator (7.1.1) via the isomorphism that sends 
—EeEC°U,R”) to X := )0; & E; € Vecto(y). Now define a family of opera- 
tors A; : CPU, R”) > C°U.R"”) by 


1. 
(A,)(t) := —2b) = S(tH)5(@) 


for 0 < t < 1. Then A, is isomorphic to the operator (7.1.1) on the interval [0, t] 
via the isomorphism that sends € € Cy°(U/,R”) to the vector field X(t) := 
(cH) E (0) € TyayM, 0 <t < t, along the curve y|jo,7}. Thus t is a con- 
jugate point of y if and only if A; has a nontrivial kernel, and the dimension of the 
kernel is the multiplicity m,(t). 

By Remark 7.1.9 A, has only positive eigenvalues for small positive t. For 0 < 
t < 1 define the operator A, : Cyr Ud, R”) = CU, R”) by 


(A810) = (4.80) = SEH - 18 EDEO, 
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If t is a conjugate point, we claim that every element & € ker(A,) satisfies 


1 


r.@):= / (§, Ag) dt = SIP. (7.1.10) 


0 


To see this, note that E(t) +1?S(tt)é(t) = Oand 4 4 S(tt) = tS(tt). Hence 


1 


re = [ (5.8) - 20. c8@nE@)) at 
0 
1 


= | (S60.2O) + 20, Seng) + EO. SeeNEO)) d 


0 


ny 


(6.80) — 4 G.)) at 


ve 


(EI + 20), E@)) at 


< (igor?) a 


| 
| 
we 
ar. 
rr 
— 
— 
™., 


This proves (7.1.10). Note also that I°,(€) < 0 unless &(t) = 0 for all ¢. 
With this understood, we appeal to the Kato Selection Theorem [35, Theo- 
rem II.5.4 and Theorem II.6.8]. It asserts in the case at hand that, near each point T 
with k := dim(ker(A,,)) > 0, there exist k continuously differentiable functions 
Aj: (t —€&, T) +£) > R such that A;(t)) = 0, the numbers A, (T),..., A, (7) are the 
eigenvalues of A; near zero, with multiplicities accounted for by repetitions, and 
the derivatives A; (to) are the eigenvalues of the crossing form I, : ker A,, > R 
in (7.1.10), again with multiplicities accounted for by repetitions. The derivatives 
are all negative by (7.1.10). Hence there exists a 6 > 0 such that A;(t) > 0 for 
™T —6 <t <1 and A;(t) < 0 for t]) < t < 1) + 6. Hence conjugate points are 
isolated, and for t9 < tT < t) +6 the operator A, has precisely k more negative 
eigenvalues than for t) — 6 < t < to. Hence the number of the negative eigenval- 
ues of A; with multiplicities is )°)_,_~, dim(ker(A,)). This proves Theorem 7.1.6. 
oO 


For more detailed explanations and other closely related applications of the Kato 
Selection Theorem the reader is referred to [61, 76]. 
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7.1.3 Locally Minimal Geodesics 
Proof of Theorem 7.1.2 We prove part (i). Let X € Vecto(M) and define 


Ys(t) 2= exp, (sX(t)) 


for 0 < t < 1 and —6 < s <6. Here 6 > 0 is chosen so small that sX(t) is con- 
tained in the domain V,(;, C T,()M of the exponential map for all ¢ and all s 
with |s| < 6. Then lim, .9 dc~(y, y;) = 0 and so y, € U for s sufficiently small. 
Hence the function (—é,5) > R: s+» L(y,) has a local minimum at s = 0, and 
since E(y) = L(y)*/2 and L(y,)?/2 < E(y;), so does the function s +> E(y,). By 
Lemma 6.6.3 this implies 


1 


0< E(y,) = / (ivx/? — (R(X, py. X)) de. 
0 


d2 
ds? s=0 


This inequality shows that the operator H,, has no negative eigenvalues. Hence, by 
Theorem 7.1.6 the geodesic y has no conjugate points t in the interval 0 < t < 1. 
This proves (1). 

We prove part (ii) in six steps. Let y € (2), be a nonconstant geodesic without 
conjugate points. 


Step 1 There exist constants 6, > 0 and ¢ > 0 such that 


1 1 


[(vxr- (R(X, yy’, x)) arze|( (IVX(@P + + |X(/?) dt Gta) 


0 0 


for every curve y' € Qyq with dci(y, y') < 6, and every X € Vecty(y’). 


By Theorem 7.1.6 the Hessian H,, has no negative eigenvalues and, since | is not 
a conjugate point of y, also zero is not an eigenvalue of H,,. Hence the smallest 
eigenvalue of H,, is positive and so, by part (e) of Remark 7.1.9, the estimate (7.1.9) 
holds for all X € Vecto(y) with a positive constant a. If the constant b > 0 is chosen 
such that (R(v, y(t))7(t), v) < blv/? for all ¢ and all v € T,)M, we obtain the es- 
timate (7.1.11) for y’ = y and X € Vecto(y) with ¢ := a/(a + b + 1). (Exercise: 
Verify this.) In a local coordinat chart on M the integrand on the left in (7.1.11) has 
the form 


> (H + Pe Tkge!) axe(é + ee.) = > Rijneki c! CE, 


ke ij bv ij ke 


where c(t) := o(y'(t)) and E(t) = dd(y’(t)) X(t) (and the coefficients g;;, i 
and Rjjx¢ are functions of c(t)). This expression depends continuously on the 
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curve c and its derivative ¢. Hence there exists a constant 6; > 0 such that ev- 
ery curve y’ € 2), with dci(y, y’) < 6; and every X € Vecto(y’) satisfies the 
estimate (7.1.11) with e = a/2(a + b + 1). This proves Step 1. 


Step 2 Let 6; > 0 be as in Step 1. Then there exists a constant 52 > 0 with the 
following significance. If Y € Vecto(y) satisfies 


sup (IX) es IVY()|) 2b, (7.1.12) 
0<t<1 

then the curve y' := exp,(Y) € S2p.q satisfies dci(y, y') < 61. 

For 0 < ft < 1 define the map f; : V,a) x Tyw)M — TM by 


fi(0.0) = (exp). v), d exp), (HO.T + hy (FO,0))) 71.13) 


for v € Vi) and V € T,)M. By Lemma 4.3.6 the map (¢,v,U) + f;(v,¥) from 
the set U := {(t, v,v)|[O<t<lveEViw,vE TM} to TM is smooth and it 
satisfies f,(0,0) = (y(t), y(t)). Thus there exist constants r; > 0 and C; > 1 such 
that all ¢ and all v,¥ € T,q)M with |v| + |v] <r) satisfy v € Vy) and 


IP-VOL+(P-POLS C(I +l), (PD) = A.D). 7.1.14) 


Choose Y € Vecto(y) such that |Y(t)| + |VY()| <7: for 0 <t <1 and de- 
fine y'(t) := exp,,)(Y(¢)). Then it follows from the Gau8—Weingarten formula 
in Lemma 3.2.3 that (y‘(t), y’(t)) = fi(Y(t), VY(0)) for all t. Hence the esti- 
mate (7.1.14) shows that Step 2 holds with 62 := min{r;, 6,;/C;}. 


Step 3 Let 53 > 0 be as in Step 2. Then there exists a constant 53 > 0 with the fol- 
lowing significance. If y' € Qpy4 satisfies the inequality dci(y, y') < 53, then there 
exists a unique vector field Y € Vecto(y) satisfying (7.1.12) and 


IYO <infyO:M), (= exp) (VO) (7.1.15) 
for0O<t<1. 
Define p := info<;<1 inj(y(t); M) > 0 and let /; be the map in (7.1.13). Its derivative 
at the origin is given by df,(0,0)(n, 7) = (7.7 + Ay) (V(t). n)) and so is invertible. 


Hence, by the implicit function theorem, there exist constants r2 > 0 and C2 > 1 
such that, for 0 < ¢ < 1, the set 


O,:= {(p.P) € TM ||p—yO|+[P-YO| <r} 


is compact, f; restricts to a diffeomorphism from a compact neighborhood of the 
origin in B,(y(t)) x Ty)M to Q;, and every pair (p, P) € Q; satisfies 


P+ <sQ(lP-vOl+B-VOl), = F.D. 7.1.16) 
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Choose y’ € §2)4 such that dci(y, y’) < ro. Then (y’(t), y/(t)) € Q, for all ¢ 
and hence there exists a unique vector field Y € Vecto(y) satisfying (7.1.15). 
By (7.1.13) it also satisfies (Y(t), VY(1)) = f,-'(y'(), y’(0)) for all t. Hence the 
estimate (7.1.16) shows that Step 3 holds with 63 := min{rp, 6./C2}. 


Step 4 Let 53 be as in Step 3. Then there exists a constant 54 > 0 with the following 
significance. If y' € 2p¢q satisfies dci(y, y') < 54, then the map p : [0,1] — [0, 1] 
defined by p(t) := L(y’)! Pi ly'(s)|ds for 0<t <1 is a diffeomorphism and 
daly, yop!) < $3. 


Define c := L(y) and C := sup, |7(t)|. Then c > 0 because y is nonconstant. We 
claim that Step 4 holds for every constant 64 > 0 that satisfies 


12 
ee _ (32+ ae Bie (7.1.17) 


To see this, fix a constant 64 > 0 that satisfies (7.1.17) and let y’ € {2,4 such 
that dci(y, y’) < 64. Since the geodesic y is parametrized by the arclength, it satis- 
fies |y(t)| = L(y) = c. Hence ||y’(t)| — c| < 64 for all t. Since 54 < ¢ by (7.1.17), 
this implies that p is a diffeomorphism. It also implies the length inequality 
|L(y’) —c| < 54 and hence 


ec Sc pry = POI SE 3 aye SE 
L(y’) c-94 


<< 
3 c+ 64 


C= 64 Cc 
Define 8 := p-! and y” := y’o B. Then B(p(t)) = 1//(t) and hence 
_ |= AOL. 1264 


8@@) - 11 = SA <= 
This implies 
; 126 126 ; 
BO-<—-, O-1<—, bO<?7. 
Hence 


lv) —yv"(@)| < |lyv@ — y(BO)| + lyBO) - vy’ (BbO)| 
<celt-—BMO|+dayy’) 
< 1354 
and 
VO —- Pr" Ol < VO — ¥BO)| + 11 - BOW BO)!| 
+ B|Y(BO) —y'(BO)| 
< Clt— BW)| +cl1-—BO| + BOdci(y, y) 


12C 
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These inequalities imply dci(y, y”) < (12C/c + 32)d4 < 63 by (7.1.17) and this 
proves Step 4. 


Step 5 Let 53 > 0 be as in Step 3 and let y' € Qyq such that dci(y, y') < $3. 
Then E(y') => E(y) with equality if and only if y' = y. 


Assume y’ # y. By Step 3 there exists a nonzero vector field Y € Vecto(y) satisfy- 
ing (7.1.12), (7.1.15), and y’ = exp, (Y). For 0 < s,t < 1 define 
Ys(t) = Expy (S¥(2)). 
Then yp = y, y; = y’, and Step 2 asserts that 
dcil¥ Vs) < 44 forO<s <1. 


Hence it follows from the argument in the proof of Lemma 6.6.3 that 
1 
72 Es) = (Iv, IsYs| = (R(OsYs; O+Ys)O; Vs> sys)) dt >0 
0 
for all s. Here the equality uses the identity V,d,y; = 0 and the inequality follows 
from Step | and the fact that d,y, = d exp,(sY)Y is a nonzero vector field along ys. 
Thus the curve [0, 1] > R: st E(y,) is strictly convex. Since its derivative vanishes 


at s = 0, it follows that E(y,) > E(y) for 0 <s <1. Since y; = y’, this proves 
Step 5. 


Step 6 Let 54 be as in Step 4. Then the C!-open set 
U:= {y' € Qyq | dcilyy’) < 64} 
satisfies the requirements of part (ii) in Theorem 7.1.2. 
Let y’ € U and define the diffeomorphism p : [0, 1] — [0, 1] as in Step 4. Then, by 
Step 4 we have dci(y, y’0 p~!) < 83 and hence, by Step 5 this implies E(y’o p~!) > 


E(y) with equality if and only if y’ o p~! = y. Since y and y’op~! are parametrized 
by the arclength, we find that 


LY’) = LY’ 0p) = V2E(y'0p-}) = V2EQ) = L(y), 


and that equality holds if and only if y’ o p~! = y. Thus every curve y’ € U that 
arises from y by reparametrization has the same length as y and every other curve 
in U is strictly longer. This proves Step 6 and Theorem 7.1.2. Ey] 


We remark that there is a precise analogy between Theorem 7.1.2 about 
geodesics (extrema of the energy functional £) and extrema of a function 


f:R">R 
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on a finite-dimensional vector space. If the function f has a local minimum at a 
point xo € R”, then the Hessian of f at xo has no negative eigenvalues and, con- 
versely, if xo is a critical point of f and all the eigenvalues of the Hessian at xo are 
positive, then Xo is a strict local minimum of /. 


7.2. The Injectivity Radius* 


Assume throughout that M C R” is a smooth m-manifold. In this section we 
prove that the function which assigns to each point p € M its injectivity ra- 
dius inj(p; M) € (0, co] is continuous. Recall the concept of a local diffeomor- 
phism as a smooth map between manifolds of the same dimension whose derivative 
at each point is a vector space isomorphism (Definition 2.2.20). Recall also the 
notation V, C 7,M for the domain of the exponential map at p and the nota- 
tion B,(p) = {v €T,M ||v| < r} for p € M andr > 0. 


Theorem 7.2.1 The function M — (0, co]: p > inj(p; M) is continuous. 
The proof of Theorem 7.2.1 is based on two lemmas. 


Lemma 7.2.2 The set U, := {p € M |r < inj(p; M)} is open for each real num- 
berr > 0. 


Proof Let r > 0 and po € U,. We prove in three steps that there exists a 6 > 0 
such that r + 6 < inj(p; M) for every p € M with d(p, po) < 6. 


Step 1 There exists a 5>0 such that B,.3(p) C V, for every p <M with 
d(p, po) < 6. 


Suppose, by contradiction, that such a constant 6 does not exist. Then there exist se- 
quences p; € M and v; € T,,M \ V; such that d(p;, po) < 1/i and |v;| <r + 1/i. 
Passing to a subsequence, if necessary, we may assume that v; converges to a 
vector vg € T,,M. Then |vo| <r and so (po, vo) € V. Since (p;,v;) € TM \ V 
converges to (Po, Vo), this contradicts the fact that V is open. This proves Step 1. 


Step 2 There exists a 5 > 0 such that the map exp, : B,+3(p) > M is a local 
diffeomorphism for every p € M with d(p, po) < 6. 


Suppose, by contradiction, that such a constant 6 does not exist. Then there exist 
sequences p; € M and v; € T,,M such that d(p;, po) < 1/i and |v;| <r + 1/i, 
and the derivative d EXP, (v;) : T,,M > Texp,, (v;)M is not injective. Passing to a 
subsequence, if necessary, we may assume that v; converges to a vector Up € T,,M. 
Then, by smoothness of the exponential map, the derivative 


d exp, (vo) : Tp), M > Texpy, (v0) M 
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is not injective. Since |vo| <r, this contradicts the fact that r < inj(po; M). This 
proves Step 2. 


Step 3 There exists a 5 > 0 such that the map exp, : B,+3(p) — M is injective for 
every p € M with d(p, po) < 6. 


Suppose, by contradiction, that such a constant 6 does not exist. Then there exist 
sequences p; € M and u;,v; € T,,M such that 


d(pi, po) < 1/i, [ae] re fy, lu;i|<r+1/i 
and 
Uj; F Vi, e€XP,, (Ui) = eXp,, (Vi) =: qi- 
Passing to a subsequence, if necessary, we may assume that the limits 
Up = jim u; € T,,M, Uo 1= jim v; € T,,M 
exist. These limits satisfy 
|uo| <1, luo] <7, EXP, (Uo) = EXPp, (Vo). 


Since r < inj(po; M), this implies ug = vo and hence 


lim u; = lim v; = vo. 
i>0oo i—>0oo 
Now define 
Uj uj 
Wj = € T,,M, Tj := |v; —u;| > 0. 


Tj 
Passing to a further subsequence, we may assume that the limit 


wo := lim w; € T,,M 


i>oo 
exists. Then |wo| = 1 and hence 


exp, (U;) — exp, (Uj 
Te Pp; (Ui) — EXP, (Ui) 


i-0oo T; 


Ui — Uj; 


dt 


L 


1 
= lim fa exp,, (ui + t(v; — u;)) 
0 


d exp, (Vo) Wo 


£0. 
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This contradiction proves Step 3. Now choose a constant 6 > 0 that satisfies the re- 
quirements of Step 1, Step 2, and Step 3. Then, for every p € M with d(p, po) < 6 
we have B,+5(p) C V, and the map exp, : B-+s(p) > M is a diffeomorphism onto 
its image. Hence inj(p; M) >r+6>r for every element p € M with d(p, po) <4, 
and this proves Lemma 7.2.2. O 


Lemma 7.2.3 The set A; :={p € M |r < inj(p; M)} is closed for each real num- 
berr > Oand hence also forr = o. 


Proof Let p; € A, be a sequence that converges to an element p € M. We prove 
in five steps that r < inj(p; M). 


Step 1 B,(p) C V,. 


Choose a tangent vector v € T,M with |v| <r, choose a constant ¢ > 0 such that 
|v| + e <r, and choose an integer i such that d(p, p;) < ¢. Define 


K := exp,, (Biyj+e(pi)) = {¢ € M | dq, pi) < |v| +e}. 


Here the second equality follows from Theorem 4.4.4 and the fact that |v| + ¢ < 
r < inj(p;; M). By definition, K is the image of a compact set under a continuous 
map, and so is a compact subset of M/. Hence 


K := {(q,w) € TM | d(q, pi) < |v| +, |w| = |u|} 


is a compact subset of TM. We claim that v € V,. Suppose, by contradiction, 
that this is not the case. Then J,,, M [0,00) = [0, 7) with 0 < T < 1. Denote by 
y : [0,7) > M the geodesic y(t) := exp, (tv). Then 


d(y(t), pi) S d(exp, (tv), p) + d(p, pi) S |tvl +e < |u| +e 


and |y(t)| = |v|, and hence (y(t), y(t)) € K for0<t <T. By Lemma 4.3.3 
and Corollary 2.4.15, this implies that there exists a constant 6 > 0 such that 
[0,7 + 5) C Jp», in contradiction to the definition of 7. This contradiction 
shows that our assumption that v is not an element of V, must have been wrong. 
Thus v € V, and this proves Step 1. 


Step 2 Let q € M such that d(p,q) <r. Then there exists a vector v € T,M such 
that |v| = d(p, q) and exp, (v) = q. 


Since p; converges to p, we have lim;_,.. d(p;,qg) = d(p,q) <r and so there 
exists an integer ig € N such that d(p;,q) <r for all i > ig. Then, for each 
i > ig, there exists a tangent vector v; € 7,,M that satisfies the conditions 
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exp,, (vi) = q and |v;| = d(pi,q) < r (Theorem 4.4.4). Passing to a subsequence, 
if necessary, we may assume that the limit v := lim)... v; exists in R”. Then 
v €T)M, |v| = lim; |v; | = limj.. d(pi, g) = d(p, gq), so v € B,(p) C V, and 
exp, (v) = limj+o0 exp, (vi) = g. This proves Step 2. 


Step 3 Let v € T,M such that |v| < r. Then d(p, exp,(v)) = |v]. 


Define q := exp,(v), so d(p,q) < |v| <r. Choose a sequence v; € T,,M such 
that |v;| = |v| for all 7 and lim;.. vj = v. Then the sequence q; := exp,, (vi) 
converges to q = exp,(v) and satisfies d(p;,q;) = |v;| by Theorem 4.4.4. Hence 
d(p.q) = limj-.00 d(p;..g;) = lim;_,~ |v;| = |v| and this proves Step 3. 


Step 4 The map exp, : B,(p) > M is a local diffeomorphism. 


Let v € B,(p) and let v € T,M be a nonzero vector. Choose any real num- 
ber 1 <A <r/|v| so that |Av| <r and define the geodesic y : [0,A] ~ M and 
the vector field X¥ € Vect(y) by 


y(t) = exp, (tv), X(t) := 7 exp,(t(v + s0)) = d exp, (tv) 
0 


s= 


for0 <t <A. By Lemma 6.1.18, X is a Jacobi field along y and 
X(0) = 0, VX(0) =t 40, X(1) = d exp, (v)v. 


Since L(y) = d(p,exp,(Av)) = d(y(0), y(A)) by Step 3, it follows from part (i) 
of Theorem 7.1.2 that the geodesic y : [0,A] — M has no conjugate points t in 
the open interval 0 < t < A. In particular, tr = 1 is not a conjugate point, and 
so X(1) # 0. Hence the derivative d exp,(v) of the exponential map is bijective 
at every point v € B,(p) and this proves Step 4. 


Step 5 The map exp, : B,(p) > M is injective. 

This is a covering argument. Let vo, v; € B,(p) with exp, (vo) = exp, (v1), choose 
a smooth path v : [0,1] > B,(p) such that v(0) = vo and v(1) = v,, and define 
y(t) := exp, (v(t)) for 0 < ¢ < 1, so y(O) = y(1) = g ‘= exp,(v9). Choose 


p <r andi €N such that |u(t)| < p for all ¢ and d(p;, p) < r — p. Then 
d(p;. y(t) < d(pi. P) + d(p. YO) < Api P) + p <r for all t. Define 


B(s, 1) = expy, (8 exp,'(g) + = s)exp,')), 0S 8,0 <1. 


This map takes values in the set U,(p) = {p’ € M | d(p, p’) < r} and satisfies 


B(0,t) = y@) =exp,(v@)), BU.t)=q for0<it <1, 
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and B(s,0) = B(s, 1) = q forall s. Since the map exp, : B,(p) — U,(p) is surjec- 
tive by Step 2 and a local diffeomorphism by Step 4, a path lifting argument shows 
that there exists a smooth map wu : [0, 1]? > B,(p) such that 


u(0,t) = v(t), exp, (u(s,t)) = B(s,t) 


for 0 <s,t <1. This map satisfies u(s,0) = vp and u(s, 1) = v, for all s and, more- 
over, the curve t +» u(1,7¢) must be constant. Hence vo = v;. This proves Step 5 and 
Lemma 7.2.3. Oo 


Proof of Theorem 7.2.1 The set {p € M |a <inj(p; M) < b} = U, \ A, is open 
for all nunbers 0 < a < b < co by Lemma 7.2.2 and Lemma 7.2.3. Hence the func- 
tion M > (0, 00] : p & inj(p; M) is continuous. Oo 


7.3 The Group of lsometries* 


This section is devoted to the Myers—Steenrod Theorem which asserts that the 
group 7(M) of isometries of a connected smooth Riemannian manifold M admits 
the natural structure of a finite-dimensional Lie group [52]. 


7.3.1 The Myers-Steenrod Theorem 


Assume throughout that M C R” is a nonempty connected smooth m-manifold. To 
state the main result, it is convenient to introduce the space 


P.qd€¢Mand®:T,M >T,M 
is an orthogonal isomorphism : 


G := \(q,@, p) (7.3.1) 


This space is a groupoid, i.e. a category in which every morphism is an iso- 
morphism. The space of objects is the manifold M, the morphisms from p € M 
to g € M are the triples of the form (q, ®, p) € G, the identity morphism from p 
to itself is the triple (p, 1, p), the inverse of (¢, ®, p) € G is the triple (p, ®"',q), 
and the composition of a morphism (q¢,®, p) € G from p to qg with a mor- 
phism (7, %q) € G from q to r is the triple (r, YW, p). 

The space G is a smooth manifold (in the intrinsic sense). To see this, consider 
the diagonal action of the orthogonal group O(m) on the product of the orthonor- 
mal frame bundle O(M) with itself (Definition 3.4.3). This action is free and the 
map z : O(M) x O(M) — G which sends the pair ((qg, e’), (p,e)) to the triple 
(q,e’0e!, p) €G descends to a bijection from the quotient (O(M) x O(M))/O(m) 
to G. By Theorem 2.9.14 there is a unique smooth structure on G such that the map 
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az: O(M) x O(M) = G isa submersion. With this structure the maps s, ft : G > M 
defined by 


5(q,®, p) = p, t(q,®, p):=4¢q 


are smooth, the map e : M — G defined by 


e(p) := (p,1, p) 


is an embedding, the inverse map i : G > G defined by 


i(q, ®, p) = (p, ®',q) 


is a smooth involution, the map s x ft: G x G — M x M is a submersion, and the 
composition map m : (s x t)~'(A) > G defined by 


m((r, YW, q), (q,®, p)) := (7, Y®, p) 


is smooth. (Here A := {(q,q)|q € M} is the diagonal in M x M.) These proper- 
ties assert that the tuple (G, s,t, e, i,m) is a smooth groupoid. 
For each p € M define 


q¢Mand®:T7,M —T,M 


: ; : ; (7.3.2) 
is an orthogonal isomorphism 


Gp i= } ®) 


This space is a submanifold of G because s : G — M is a submersion. The space 
Gp is also a smooth submanifold of IR” x £(7,M, IR”) and is diffeomorphic to 
the orthonormal frame bundle O(M/) C R"” x R”"*” via the map G, > O(M): 
(q,®) +> (g, ® 0 e) for any orthonormal frame e of 7,M. Since M is connected, 
Lemma 5.1.10 asserts that the map 


tp: 1(M) > Gp, tr(h) = (b(p),do(p)). (7.3.3) 


is injective for every p € M. Denote the image of this map by 


Tp = y(1(M)) = {(¢(p). db(p)) | @ € 1(M)} C Gp. (7.3.4) 


In the following theorem we do not assume that M is complete. For a space of 
smooth functions or maps on M we use the term C’™ topology to mean the topology 
of uniform convergence with all derivatives on each compact subset of M. Likewise 
we use the term C° topology to mean the topology of uniform convergence on each 
compact subset of M. The latter is also called the compact-open topology (because 
a basis of the topology consists of sets of maps, one for each compact subset of the 
source and each open subset of the target, that send the given compact subset of the 
source into the given open subset of the target). 
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Theorem 7.3.1 (Myers-Steenrod) Let M C R” be anonempty connected smooth 
m-manifold. Then the following holds. 


(i) There exists a unique smooth structure on the isometry group 1(M) such that 
the map tp :I(M) — Gp in (7.3.3) is an embedding for every p € M. The 
topology induced by this smooth structure agrees with the C° topology and 
with the C™ topology on 1(M) and dim(J(M)) < m(m + 1)/2. 

(ii) With the smooth structure in part (i) the maps 


1(M) x 1(M) > 1(M): (vb) > poe 
and 
1(M) > 1(M): $+ go"! 
are smooth. Thus 1(M) is a finite-dimensional Lie group. 


(iii) The Lie algebra of 1(M) is the space Vectx,.(M) of complete Killing vector 
fields (defined below). 


Proof See Sect. 7.3.4. Oo 


7.3.2 The Topology on the Space of Isometries 


The next lemma shows that for each p € M the set J, in (7.3.4) is a closed subset 
of G, and that the map t, : 7(M) — 4, in (7.3.3) is ahomeomorphism with respect 
to the C™® topology on 7(M). 


Lemma 7.3.2 Fix two elements po,qo € M, let ®) : T,,M — T,,M be an or- 
thogonal isomorphism, and let ¢; : M — M be a sequence of isometries. Then the 
following are equivalent. 


(i) The sequence t,,(¢;) € Ip, converges to the pair (qo, Po) € Gyo, Le. 
jim $i (Po) = qo, jim d¢i (po) = Po. (isa) 


(ii) There exists an isometry ¢: M — M such that $(po) = qo, €¢(pPo) = Po, 
and ¢; converges to @ in the C™ topology. 


Proof That (ii) implies (i) follows directly from the definitions. We prove in three 
steps that (i) implies (ii). For p ¢ M andr > 0 denote by V, C 7,M the do- 
main of the exponential map of M at p (Definition 4.3.5), and define U,(p) := 
{q € M|d(p,q) <r} and B,(p) := {v € T,M | |v| <r}. 
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Step 1 Assume (7.3.5) holds and choose a real number 0 <r < inj(po; M). 
Then r < inj(qgo; M) and ¢; converges on the open set U,(po) in the C™ topology 
to the isometry $9 := €XPy, 0 Poo exp, : U,(po) > U; (qo). 


Since r < inj(po; M), Corollary 5.3.3 asserts that 


B,($i(Po)) = 4¢; (Po) B, (Po) € 4i (Po) Vg © Vo; (po) 


and ; © €xp,, = EXPg,(p)) ° Pi : Br(Po) > U;(Gi(Po)). Thus the map 


EXPy,(p) = Pi SEXP, © P; | : B-($;(Do)) > Ur (4; (Po)) 


is a diffeomorphism and so r < inj(¢;(po); M) for each i € N. Since ¢;(po) 
converges to qo, this implies r < inj(qo; M) (Lemma 7.2.3). Hence the sequence 
of maps expy,,,) ° Pj converges to the map exp,, © ®p in the C™ topology 
on B,(po). Hence the sequence of isometries 


gi = EXP$;(p9) © Pj 0 EXD, :U,(po) > M 
converges in the C™ topology to the diffeomorphism 
oo := €XPy, © Qo o exp, : U,(po) > U;(qo). 


The map @p satisfies 
d(po(P), bo(g)) = lim d(gi(p), $:(9)) = (p,q) 


for all p,q € U,(po) and hence is an isometry (Theorem 5.1.1). This proves Step 1. 


Step 2 The sequence $; converges in the C™ topology on all of M to a smooth 
mapo: M > M. 


Define the set My := {p € M | the sequence (¢;(p), dg; (p)) converges}. This set 
is nonempty because po € Mo. We prove that Mo is open. Fix any element p € Mo 
and define g := lim; ¢;(p) and ® := lim;_... d¢; (p). Choose a real number r 
such that 0 <r <inj(p; M). Then Step 1 asserts that r < inj(g; M) and the sequence 
gilu,(p) converges to the diffeomorphism exp, o ® o exp,! : U,(p) — U,(q) in the 
C® topology on U,(p), and hence U,(p) C Mo. This shows that Mp is open, that 
gi |m) converges in the C®-topology to a smooth map ¢ : My > M, and that @(Mo) 
is an open subset of M. 

We prove that Mo is closed. Let p, € Mo be a sequence that converges to an 
element p € M. Then there exists areal number r > 0 such that r < inj(p,; M) for 
all v. Hence B,(p,) C Mo for all v by Step 1. Choose v so large that d(py, p) <r 
to obtain p € Mo. This shows that Mo is closed. Since M is connected, we deduce 
that Mo = M. This proves Step 2. 
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Step 3 The map ¢: M — M in Step 2 is an isometry. 


We claim that the triple po, go, ®o in (7.3.5) satisfies 
jim $7 '(Go) = Po, dim db; "(qo) = Bo". (7.3.6) 


Namely, the sequence d(¢;'(qo), po) = d(qo, $i (Po)) converges to zero by as- 
sumption, and by Step 1 the derivatives d¢; converge to d¢ uniformly in some 
neighborhood Up C M of po. Since ¢7'(qo) € Up for i sufficiently large, this 
implies that the sequence d¢;(¢; '(qo)) converges to df(po) = Bo and hence the 
sequence d¢;($;'(qo))| = dd; !(qo) converges to ®,'. This proves (7.3.6). 
By (7.3.6) and Step 2 the sequence ' converges in the C® topology to a 
smooth map y: M — M. By uniform convergence on compact subsets of M 
we have yo@ = id and doy = id, so ¢: M —> M is a diffeomorphism. More- 
over, since ¢; is an isometry for each i and converges pointwise to ¢, we 


have d($(p),$(q)) = lim; 0 (i (P). $i (q)) = a(p.q) for all p,q € M, so : 
is an isometry. This proves Step 3 and Lemma 7.3.2. 


The next goal is to verify that the spaces J, in (7.3.4) are diffeomorphic to each 
other. For po, p) € M define the map Fp, ,p) : In) > Ip, by 


Fp1.po( (Po), 4h (Po)) = (P(P1),db(pi), 6 € 1(M). (7.3.7) 


This map is well-defined by Lemma 5.1.10, because M is connected. Collectively, 
these maps give rise toa map ¥ : M x J — 7 defined by 


1 := {(6(p).do(p), p)| p € M, 6 € 1(M)} CG, 
F (Pi, (P(Po), dh ( Po). Po) = (P(P1), d(p1). P1) 


for po, Pp) € M and ¢ € J(M). The next lemma uses the concept of a smooth map 
on an arbitrary subset of Euclidean space as in the beginning of Sect. 2.1. 


(7.3.8) 


Lemma 7.3.3. The map F¥ : M x I > Tis smooth. In particular, for each pair of 
points po, p, © M, the map Fp, .p) : In) > Ip, is a diffeomorphism. 


Proof Let po, p; € M and choose a smooth path y : J = [0, 1] ~ M with the end- 
points y(0) = po and y(1) = p,. Let U, C G,, be the set of all pairs (qo, ®o) € Gp, 
such that there exists a development (9, y, y’) on the interval J that satisfies 


y'(0) = qo, P(t) = Do. (7.3.9) 


By Remark 3.5.22, the set U, is open in G,, and the map U,, — G that sends the 
pair (qo, ®o) € U, to the pair (y’(1), ®(1)) € Gp, is smooth. This shows that there 
is a unique diffeomorphism 


Fy Uy A, y ‘(t) :=y(Ql—9d), Us: = Gis Ut Ga, 
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that satisfies the condition 


Fy(v'(0), PO) = (VC), ®M) (7.3.10) 


for every development (®, y, y’) of M along M on the interval J. The inverse of F,, 
is the smooth map F,-1 : U,-1 > U,. If ¢@ € 1(M), then by Lemma 6.1.12 there 
exists a development (9, y, y’) on J satisfying the initial conditions y’(0) = $(po) 
and &(0) = d¢(po), and it is given by 


YO=¢7M), Pt) =dde(v@) (7.3.11) 
fort € I. Hence (d(po), db(po)) € U, and by (7.3.10) and (7.3.11) we have 


Fy($(Po), 4b(Po)) = (b(P1), 4 (P1)) = Fo1.09(P (Po). 4b (Po)) 


for every 6 € 1(M). Thus J,, C U, and J,, C U,-1 and Fy|1,, = Fp,,po- Hence 
Fp,po 18 smooth. The smoothness of F follows from the smooth dependence of the 
map F, on the curve y, the verification of which we leave to the reader. This proves 


Lemma 7.3.3. [I 


7.3.3 Killing Vector Fields 


Killing vector fields are defined as those vector fields on M whose flows are one- 
parameter families of isometries. Assume throughout that M is a nonempty con- 
nected smooth m-dimensional submanifold of R”. Let X be a vector field on M 
and denote by 


RxMDD—>M: (tp) ott, p) = ¢'(p) 


the flow of X (Definition 2.4.8). Then Theorem 2.4.9 asserts that D is an open subset 
of R x M and ¢ is smooth. Thus, for every t € R, the set D; := {p € M | (t, p) € D} 
is open in M and the map ¢' : D,; > D_, is a diffeomorphism with the inverse 
o':D_.—> Dy. 


Lemma 7.3.4 Jn this situation the following are equivalent. 


(i) Foreveryt € R the diffeomorphism ¢' : D; > D_,; is an isometry. 
(ii) For every p € M and every pair of tangent vectors u,v € T,M, we have 


(V,X(p), v) + (u, Vy X(p)) = 0. (7.3.12) 
Proof Let pe M and ve T,M. Choose a smooth curve a: R— M such 
that a(0) = p and &(0) = v, let 2 := {(s,t) € R*| (s,a(t)) € D}, and define the 
map y : 2— M by y(s,t) := ¢'(a(s)) for (s,t) € 2. Then 


d,y(0,t) = dd’ (p)v, aya=Xoy, V;0r¥ = Vay X(y). 
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Thus the formula V;d,;y = Vs0;y in Lemma 3.2.4 shows that 
V,do' (pv = Vag' (pw X($'(P)) (7.3.13) 
for all t € R and this implies 
d 2 
de (P)vl” = 2(Vaor nr X(b'(P)). db" (p)»). oo) 
The right hand side vanishes for all p, v, ¢ if and only if X satisfies (11), and the left 


hand side vanishes for all p, v, ¢ if and only if the flow of X satisfies (i). This proves 
Lemma 7.3.4. Oo 


Definition 7.3.5 A vector field X € Vect(M) is called a Killing vector field 
(named after Wilhelm Karl Joseph Killing [39]) iff it satisfies equation (7.3.12) for 
all p € M and all u,v € T,M. The space of Killing vector fields will be denoted 
by Vectx(M). 


The space of Killing vector fields is a vector subspace of the Lie algebra of 
all vector fields on M. The next lemma shows that it is a finite-dimensional Lie 
subalgebra. Part (11) is the linear counterpart of Lemma 5.1.10. 


Lemma 7.3.6 Let M C R" be a nonempty connected smooth m-manifold. Then 
the following holds. 


(i) If X is a Killing vector field on M and y : M — M is an isometry, then the 
pullback yy*X is a Killing vector field. If X and Y are Killing vector fields 
on M, then so is their Lie bracket |X, Y]. 

(ii) If X is a Killing vector field on M and there exists a py € M such that 


X(po) =0, VX(po) = 0, (7.3.15) 


then X(p) = 0 forall p € M. 
(iii) If M is complete and X is a Killing vector field, then X is complete. 


Proof We prove part (i). Let X be a Killing vector field, let 6’ : D; > D_,; be the 
flow of X, and let y : M — M bean isometry. Then 


vlodiop:w (Di) > ¥ (D1) 


is the flow of yw*X. Hence, by Lemma 7.3.4, the flow of y*X is a one-parameter 
family of isometries and so y*X is a Killing vector field. Now assume that Y is an- 
other Killing vector field and denote its flow by y’. Then each y’ is an isometry and 
so the pullback (w')*X is a Killing vector field for each t. Hence, by Lemma 2.4.18 
and Exercise 5.2.9, the Lie bracket [X, Y] = a o(¥')*X is also a Killing vector 
field. This proves (1). 
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We prove part (ii). Let 6’ : D; — D_, be the flow of a Killing vector field X 
and assume that there exists a po € M such that (7.3.15) holds. Then it follows 
from (7.3.13) that (po) = po and dg‘ (po) = id for all t. Hence, by Lemma 5.1.10 
the isometry ¢, :D; — D_, is the identity for each t € R. Thus D; = D_, = M 
for all t and X(p) = 0 for all p € M. This proves (ii). 

We prove part (iii). Thus assume that M is complete and X is a Killing vec- 
tor field. Let y : J — M be an integral curve of X on its maximal existence 
interval J C R and denote p := y(0). Differentiate the equation y(t) = X(y(t)) 
to obtain fio = 2(VyyX(y()), y(t) = 0 for all t €¢ J. Hence the func- 
tion ] > R:t + |y(¢)| is constant and this implies the inequality d(p, y(t)) < 
i ly(s)|ds = t|X(p)| for all t e 7. Since M is complete, the closed ball of ra- 
dius R about p is compact for every R > 0 (Theorem 4.6.5). Hence the restriction 
of y to any bounded subinterval of J is contained in a compact subset of M amd 
by Corollary 2.4.15 this implies 7 = R. This proves (iii) and Lemma 7.3.6. Oo 


The proof of Theorem 7.3.1 is somewhat analogous to the proof of the Closed 
Subgroup Theorem 2.5.27. The first parallel is in part (i) of the next lemma, 
which asserts that the space of complete Killing vector fields is a Lie subalgebra 
of Vect(M) and can be viewed as an anlogue of Lemma 2.5.29. Part (ii) is the 
analogue of Lemma 2.5.28. 


Lemma 7.3.7 


(i) The set Vectx,.(M) of complete Killing vector fields on M is a Lie subalgebra 
of Vectx(M). 

dai) LetRx M—> M :(t, p) & W;(p) be a smooth map such that YW, is an isom- 
etry for every t € IR and define X(p) := i |, 9 (p) for p € M. Then X isa 
complete Killing vector field. 


Proof The proof has three steps. 

Step 1 Let U,V C M be nonempty open sets, let 6 : U — V be an isometry, and 
let dy : M — M be a sequence of isometries that converges uniformly on every 
compact subset of U to ¢. Then @ extends uniquely to an isometry from M to M 


and $x converges in the C™ topology to this extension. 


Fix an element po € U. Then by part (1) of Exercise 5.1.11 we have 
jim bx (Po) = o(Po), im: dx (Po) = 46( Po). 


Hence the assertion of Step | follows from Lemma 7.3.2. 
Step 2 We prove part (ii). 


That X is a Killing vector field follows from the same argument as in Lemma 7.3.4 
with time dependent vector fields. Denote by ¢’ : D, — D_; the flow of X. Then 
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the sequence of isometries Wi) , : M — M converges to ¢' uniformly on every 
compact subset of D, (see Exercise 7.3.10 below). Hence Step 1 asserts that ¢' 
extends to an isometry on all of M whenever D, is nonempty, in particular for 
small t. The extended isometries still satisfy 6°t' = $° od’ and 0,¢' = X o ¢’. 
Thus D,; = M for small ¢ and so for all t, because P* (D_s N‘D;) C Ds 4; (Theo- 
rem 2.4.9). This proves Step 2. 


Step 3 We prove part (i). 


Let X,Y € Vectx,.(M), let ¢' be the flow of X, and let w’ be the flow of Y. Then 
4 of 2 V' (Pp) = X(p) + Y(p) for all p € M, and so X + Y is a complete 
Killing vector field by Step 2. Hence Vectx,-(M) is a vector space. It is finite- 
dimensional by Lemma 7.3.6. Moreover, (Ww 0 ¢* 0 W')seR is the flow of the 
pullback vector field (w’)*X, hence (w’)*X is a complete Killing vector field 
for each ¢ € R, and hence so is the Lie bracket [X, Y] = flow)" X by finite- 
dimensionality. This proves Step 3 and Lemma 7.3.7. Oo 


Exercise 7.3.8 Let M C R” be a smooth manifold and let X be a vector field 
on M. Consider the following condition. 


(K) [fy : I — M is a geodesic, then X oy € Vect(y) is a Jacobi field along y, i.e. 
it satisfies the differential equation VV(X oy) + R(X oy, y)y = 0. 


Prove that every Killing vector field satisfies (K) and use this to give an alternative 
proof of part (ii) of Lemma 7.3.6. If M is compact, prove that a vector field satis- 
fies (K) if and only if it is a Killing vector field. Find a vector field on a noncompact 
manifold that satisfies (K) but is not a Killing vector field. Hint: Differentiate the 
function tf +> (X(y(t)), y(t)) twice. 


Exercise 7.3.9 (Grénwall’s inequality) Jfa,c >0ando: [0,T] > R is a con- 
tinuous function satisfying 


t 
0<o(t)<at c f as) ds for0 <¢t <T, (7.3.16) 
0 


then o(t) < ae“ for every real number 0 < t < T. Hint: The function 


t 


T(t) := [cee —a(s))ds 


0 


satisfies t(t) >ct(t) for all t. Differentiate the function t + e~ t(t) to show that t 
is nonnegative and nondecreasing. 
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Exercise 7.3.10 Let 7T,c,¢ be positive real numbers, let M C R” be a smooth 
m-dimensional submanifold, and let [0,7] x M — R": (t, p) & X;(p) and 
[0, T] x M > R": (t, p) > Y;(p) be continuous maps that satisfy the condi- 
tions X;(p), ¥:(p) € T,M and 


IX; (p) — Y;(p) IR" < €, IX:(p) — X:( pn < clP — @|R* 317) 


for all p,q € M and allt € [0, T]. If the curves 8, y : [0, 7] > M are solutions of 
the differential equations 6(t) = X,(B(t)) and y(t) = Y;(y(t)), prove that 


IB(t) -—y@I|r < (10) — y(0)|Rx + Te)e" (7.3.18) 


for 0 < t < T. Relax the continuity requirement in ¢ on the vector fields to piece- 
wise continuity. Hint: Define the function o : [0,7] — R by 


o(t) := [BO —yOle. 


Show that 
t 
o(t) <o(0)+te+c / o(s)ds 
0 
and use Gronwall’s inequality. 


7.3.4 Proof of the Myers-Steenrod Theorem 


With these preparations we are ready to prove Theorem 7.3.1. The following lemma 
is the heart of the proof. It is the analogue of Lemma 2.5.30 in the proof of the 
Closed Subgroup Theorem 2.5.27. 


Lemma 7.3.11 Fix an element po € M, a tangent vector vo € T,, M, and a linear 
map Ag: Ty,M — T,,M. Let ¢; € 1(M) be a sequence of isometries and let t; be 
a sequence of positive real numbers such that 


lim 7=0, — lim $;(po) = po. ~—_lim d@;(po) = 1, (7.3.19) 


100 


and, for allv € T,,M, 


im # (Po) = Po _ . Adi (po)v — v 
im —~————- = up, im —— 


i—0o Tj i—oo Tj 


=e (7.3.20) 


Then there exists a unique complete Killing vector field X on M such that 


X(po) = v0, 4X(po) = Ao. (7.3.21) 
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For every p © M and every v € T,M this vector field satisfies 


lim 


i—0oo 


gi(P)-— Pp _ dp;(p)v —v 
FUME peaks Lda 
Tj Tj 


X(p), ii = dX(p)v, (7.3.22) 


and the convergence is uniform on compact subsets of TM. 


Proof By (7.3.19), (7.3.20), and Exercise 2.2.4 we have 


(po, Ao) € T(p9.1)Gpo- 


Let p € M and recall the definition of the map F, p, : Ip) > Jp in (7.3.7). By 
Lemma 7.3.3 this map extends to a diffeomorphism F, from an open sub- 
set U, C Gp, containing the set J,, to an open subset U,-1 C G, containing 
the set J, and it satisfies F,(po, 1) = (p, 1). Define 


(X(p), A(p)) := dF, (Po, 1)(vo, Ao) € Tip.1)Gp- 


Since Fy (9; (Po), 46; (Po)) = (¢; (Pp), d¢; (p)) for all i, it follows from (7.3.20) and 
Exercise 2.2.16 that, for all v € T,M, we have 
lim = A(p)v. (7.3.23) 


X(p), li 


1-00 


Pi(P)— P _ _ dgi(p)v — v 
Tj 7 Tj 
This formula shows that the pair (X(p), A(p)) € T(p,1)Gp is independent of the 
choice of the extension ¥, used to define it. Moreover, it follows from the smooth- 
ness of the map F in Lemma 7.3.3 that the map p +> (X(p), A(p)) is smooth and 
that the convergence in (7.3.23) is uniform as the pair (p, v) varies over any com- 
pact subset of TM. 
Now let v € 7,M and define y(t) := exp, (tv) for ¢ € J,.,. Then 


X(y(0) — X(p) = im $i v) =). lim oP) =p 
= fe (v@)) — (VO) — y@) + yO) 
~ i>oo Tj 
ita i dd; WOO) — p(s) ds 

I “ T; 
= f Aonroras. 


0 


Here the last step uses uniform convergence on compact sets in (7.3.23). Divide by f 
and use the continuity of the curve t + A(y(t))y(¢) to obtain 


X(exp, (tv) — X(p) 
. 


1 t 
A(p)o = tim = / A(y(s))9(8) ds = lim 
0 
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Hence, by Exercise 2.2.16, we deduce that, for all p € M andallv € 7,M, 
A(p)v = dX(p)v. 


By (7.3.20) and (7.3.23) this shows that X satisfies (7.3.21) and (7.3.22). 
By (7.3.22) and Exercise 2.2.4 we have (X(p),dX(p)) € Tip1)Gp and this im- 
plies (v, dX(p)v) = 0 for all v € T,M. Here is a more direct proof of this crucial 
fact. Namely, by (7.3.22) we have 


. |dgi(p)v — v|? 
jim ——,—— = |dX(p)u)’. 
Hence 
eaten =e Oe 
1-0 Tj 
= som (2: 44i(P)) = LWP 
= lim 
1—->0o Tj 
=i 2(v, dd(p)v) — |v? — |dg;(p)v|? 
= lim 
p=F00 2; 
= li |\dbi(p)v = v|? Tj 
= — lim —~.——_-> 
i>oo T; 2 
=0 


for all p € M and all v € T,M. This shows that X is a Killing vector field and so 
it remains to prove that X is complete. 
Let ¢' : D; — D_; be the flow of X and, for p € M, denote by 


Ip := {t€R|peD,} 


the maximal existence interval for the solution y of the initial value problem 
y(t) = X(y(t)), y(O) = p. We prove in three steps that X is complete. 


Step 1 Let p € M and let T > 0 such that T|X(p)| < inj(p; M). Then 
[-T,T] C Ip. 


Define y(t) := ¢'(p) for t € J,. Since X is a Killing vector field, the diffeomor- 
phism ¢' : D, — D_, is an isometry by Lemma 7.3.4, and hence 


IP] = |X" (P))| = ldo" (P)X(p)| = |X(P)I, 


for all ¢ € I,. This implies d(p, y(t)) < t|X(p)| for all t € J,, and hence y cannot 
leave the compact set Ur) x(p)\(p) = {¢ € M | d(p.q) < T|X(p)|} on any subin- 
terval J C [—T, T]. Hence [—T, T] C J, by Corollary 2.4.15 and this proves Step 1. 
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Step 2 Let p € M. Ift € R satisfies |tX(p)| < inj(p; M) and the sequence of inte- 
gers m; € Z is chosen such that 


mit <t < (mj; + 1)t;, (7.3.24) 
then $'(p) = limo $;" (p). 
Let T > 0 such that T|X(p)| < inj(p: M). Then [0, 7] C J, by Step 1 and the set 
K := {¢'(p)|0<t <T} 


is compact and | X(q)| = |X(p)| for all g ¢ K. By Lemma 4.2.7 there exists a con- 
stant 5 > 0 such that, for all g,go,q1 € M, 


d(qo. 41) 


qéK, d(q.qo) <6, d(g.q) <6 = 
lgo- 1 


28. 795) 


Fix a real number 0 < e < 1 and choose ig € N such that, for all i > io, 


tj |X(p)| <6, (1 +1%)|X(p)| < inj(p: M), sp akagla)) = 8; (7.3.26) 
qe 


sup 
qeK 


— X(q)| <é, sup 


qeK 


gi a q —X(q)|<e. (7.3.27) 


o'(qM—4 

Tj 
Then, for all 7 > ig and all g € K, we have d(g,¢@"%(q)) < t|X(p)| < 6 and 
d(q, ¢;(q)) < 6 by (7.3.26), and hence by (7.3.25) and (7.3.27), 


sup d(¢;(q). ¢" (q)) < 2 sup|¢;(q) — 6" (q)| < 4tie. (7.3.28) 
qeK qeK 


Now choose a real number 0 < t < T and choose m; € No as in (7.3.24). Then, 
fork =1,..., m; — | we have ¢*%(p) € K and hence 


d(4i*"(p), 6" (p)) 
< d($:(6 (P)). 6: (6*" (W))) + d(:(O** (D)).° (8° (D))) 
= d(ok(p).6** (p)) + d(gi(6*" (P). 6" (0 (D))) 
< d(4!(p). 6" (p)) + 4n6. 
Here the last inequality holds for i > i by (7.3.28). By induction this implies 


d(p;"(p).¢"'"(p)) < 4m; te < 4Te 
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for all i > ip. Hence 
Jim $7" (p) = lim 9" (p) = $'(p) 
and this proves Step 2 for f > 0. Fort < 0 the argument is similar. 
Step 3 /, = R for every p € M. 


Fix an element p € M and real number T > 0 such that T|X(p)| < inj(p; M). 
Choose the sequence m; € No such that t;m; < T < (m; + 1)t;. Then Step 2 as- 


serts that 67 (p) = lim;_,~ ;" (p). Since ¢;"" : M > M isan isometry, it follows 
that T|X(p)| < inj(¢;" (p); M) for alli € N and hence 


T|X(p)| < inj(o" (p); M). 
By Step 1 this implies [0, 7] Cc gr (p). Hence [0,27] C J, and, by Step 2, 
g°'(p) = lim $5" (6"(p)). 
Continue by induction to obtain for all k € N that 
T|X(p)| < inj(¢*" (p); M) 
and hence [0, (k + 1)T] C J, and 
pO (p) = lim 7" (4 (p)). 


Thus [0,00) C J, and the same argument shows that (—oo, 0] C J,. Hence X is 
complete. This proves Step 3 and Lemma 7.3.11. oO 


The next lemma establishes the smooth structure on J7(M), in analogy to the 
proof of the Closed Subgroup Theorem 2.5.27. 


Lemma 7.3.12 For every p € M the set 1, is a smooth submanifold of G, and its 
tangent space at (q,®) € 1, is given by 


T(q.0)Ip = {(X(q),dX(q)®) | X € Vectx,<(M)}. (7.3.29) 


Proof By Lemma 3.4.5 with R” replaced by TM the tangent space of G, at an 
element (¢, ®) € G, is given by 


G € T,M, © € L(T)M,R"), 
Tia.0)Gp = } G.®) | (1 — 1(q)) ® = hg @®, and . (7.3.30) 
(fu, hv) + (Ou, Gv) =OV u,v € T,M. 
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If (¢,®) € G, and X € Vect(M), then (1 — IT(q))dX(q)® = h,(X(q))® by the 
GauB—Weingarten formula in Remark 5.2.5. Moreover, if X is a Killing vector field, 
then (@u,dX(q)@v) + (dX(q) Gu, Sv) = 0 for all u,v € T,M, so it follows 
from (7.3.30) that the pair (G, ®) with G = X(q) and @ = dX(q)@ is a tangent 
vector of G, at (q, ®). 

Now abbreviate 


m(m + 1) 


k := dim(Vectx,.(M)) < 5 


= dim(G,) =: €. 
Fix an isometry ¢o : M — M and define 


(qo, Po) := (¢o(P). dho(P)) = 'p (go) € Ip. 


We must construct a coordinate chart on G, in a neighborhood of the point 
(qo, ®o) € J, with values in an open set 2 C R¢ that sends J, to the intersec- 
tion of 2 with R* x {0}. For this we first choose a basis Y;,..., Y; of Vectx..(M) 
and then a basis n; = (q, ®),....1 = (Gy, ®,) of the tangent space Tiga Ge 
such that 


nj = J. ®)) = (¥j(olP)). 4 ¥;(bo())dgo(p)). J = Lave ke 
Next we choose any smooth map 
Re GC est) 


such that 1(0) = (qo, ®o) and 
—0,....0=74=G,9;), jfHk+...€ 


For a complete Killing vector field X € Vectx,.(M) let wy € J(M) be the time-1 
map of the flow of X and define the map @ : R’ > Gp by 


A(t...) = (Wx@). dx), 
X:= OY +---+0°Y%, (7.3.31) 
(q.®) := Ue ssaged) 


for (e's . .,t°) € R°. Then 


O(0) = (0) = (qo, Po) = tp (po) € Tp 
and 


dO) (7',....7) =T mm +2 +7 ne 
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for ('. dist a) € R°. Thus the derivative d@(0) : R' > T(qo.8))Gp is a bijective 
linear map and so the Inverse Function Theorem asserts that the map @ restricts to a 
diffeomorphism from a sufficiently small open neighborhood 2 C R* of the origin 
onto its image 1($2) C Gp, which is an open neighborhood in G, of the point O (0) = 
tp(¢o) € Jy. With this understood, the assertion that J, is a smooth submanifold of G, 
is a direct consequence of the following Claim. 


Claim There exists an open set 2 C R° such that 


0€ 2c 2, @(2oN (R* x {0})) = Uo NJ, Uo := O(N). 
(7.3.32) 


Suppose, by contradiction, that such an open set $29 does not exist. Then there exist 
sequences ft; = (¢; siege) € R“ and ¢; € 71(M) such that 


jim t = 0, t; € 2 \ (R* x {0}), O(t:) = tp(4:) € Ip- 
Define 
X= HY, 4-.-4+0°%;, (gi, ®:) = (th*!,..., tf). 
Then (Wx; (qi), dWx; (qi) Pi) = O41) = (bi) = (Gi (P), 44; (p)) and hence 
gi = (Wy) 0 bi)(P), ®; = dy, (qi) 'ddi(p) = d(yy} © $i)(p). 


Thus tp (Wy! 0 $7) = (qi, &) = O(0,...,0, tk*1!....,t/) is still a sequence in the 
set O(Q \ (R* x {03))/N 1, that converges to t,(¢o). Thus we may assume without 
loss of generality that t} = iL == iM = 0 and so 


L(t;) = tp(@i), jim Ly (Pi) = tp (ho). (7.3.33) 


Then ¢; 0 ¢ 9! converges to the identity in the C® topology by Lemma 7.3.2. 
Since ¢; 4 go, we have 


d(¢; 0b! = 
4 = (G2 5O=0/4 ip “OOO I=". 
ofvET,>M |v| 


0 (7.3.34) 


for alli by Lemma 5.1.10, and lim;_,,, t; = 0. Hence, by Exercise 2.2.4 there exists 
a tangent vector (v9, Ao) € T(p.1)Gp and a subsequence, still denoted by ¢;, such 
that, for all v ¢ 7,M, 


Aov. (7.3.35) 


im og)'\(p)— Pp _ : fam ogy '\(pyu-v _ 
. 05 nee 4 ~ 


i>oo T; i>oo Tj 
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It follows from (7.3.35) and Lemma 7.3.11 that there exists a complete Killing 
vector field X € Vectx,-(M) such that 


X(p)=v,  dX(p) = Ao. 
This vector field is nonzero because (vo, Ao) 4 0 by (7.3.34). Moreover, by equa- 
tion (7.3.22) in Lemma 7.3.11, with ¢; replaced by ¢; © oe and the pair (p, v) 
replaced by the pair (¢o(p), dgo(p)v), we obtain 


. Pil(P) — do(P) 
m AO 


X(g¢o(P)) = jim : 
dd; —d 
AX (bo(p))dbo(p)v = lim gi (p)v : do(p)v 


for v € T,M. Since by (7.3.33) the sequence 
($i (P). 4Gi(P)) = (Gi) = Uti) 


converges to (fo(p). dbo(p)) = «(0), the pair (X(bo(p)). dX (bo(P))dbo(p)) be- 
longs to the image of the derivative di(0) by Exercise 2.2.4, and hence is a nonzero 


linear combination of the vectors nz41,..., ¢. It is also a linear combination of the 
vectors 71,...,%;, because X is a complete Killing vector field and so is a linear 
combination of the vector fields Y,,..., Y;. This is a contradiction, and this con- 


tradiction proves the Claim. Thus there does, after all, exist an open set 29 C Ré 
that satisfies (7.3.32), and the map O~! : Up — Mp is then a coordinate chart on Gp 
which satisfies 


O|!(Up NI,) = 2o M1 (RK x {0}). 


Hence 7, is a submanifold of G, and this proves Lemma 7.3.12. O 


Proof of Theorem 7.3.1 By Lemma 7.3.3 and Lemma 7.3.12 there exists a unique 
smooth structure on 7(M) such that the map 1, :7(M) — G, in (7.3.3) is an embed- 
ding for every p € M. That the topology induced by this smooth structure agrees 
with the C™ topology was established in Lemma 7.3.2. That it also agrees with 
the compact open topology (i.e. with the C° topology of uniform convergence on 
conpact sets) is the content of Exercise 5.1.11. This proves part (i). 

We prove part (ii). Recall the definition of the map 


F:MxI-tI 
in (7.3.8) and the target map t : G > M, the inverse mapi : G — G, and the com- 


position map m : (s x t)~'(A) > G in the beginning of Sect. 7.3.1. These maps are 
all smooth and turn 7 into a smooth subgroupoid of G. For each p € M they endow 
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the submanifold J, C G, with the structure of a Lie group as follows. The unit is 
the element 


ep := (p.1) € J, (7.3.36) 


The product is the map m, : J, x J, — J, defined by 


(p(n. €). p) = m(F (eE. p). (n. P)). (EP) (7.3.37) 
for &, € 7,, and the inverse map is the involution 7, : J, — 7, defined by 


(ip(E), p) :-= F(p.ig, p)) (7.3.38) 


for € € J,. The maps m, in (7.3.37) and /, in (7.3.38) are smooth by definition. To 
show that they define a group structure and that the map 


lp: 1(M) > 1, 
in (7.3.3) is a group isomorphism, we must verify the identities 


tp (id) = ep, 
Mp(tp(W), lp()) = ly (y og), (7.3.39) 
ip (tp(@)) = Ce) 
for all 6, w € J(p). The first equation in (7.3.39) follows directly from the def- 


initions. To prove the remaining equations, fix two isometries @ and wy and de- 
fine &,y,¢ € 7, by 


g = lp (@), VS lp (W), g — p(w op). 


Then, by the chain rule, 


(6p) = (WGP). dvG(P))d9(P). P) 
m|(v(b(p)). dV ((P)). (P)). ($(). d4(P), P)) 
m(F(o(p).(W(p), dW). P)). ((P). d4(P). P)) 


m(F(E. Pp). (n, p)), (8, »)) 
(mp(n, €), P)- 


Here the last equality follows from the definition of the map m, in (7.3.37). This 
proves the second equation in (7.3.39). 
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To verify the third equation in (7.3.39), we compute 


(p($'). P) = (6° '(P), dg"(P), P) 

= F(p.($ '(b(p)), dd" (¢(P)), $(P))) 

= F(p.(p.do(p) |, 6(p))) 

= F(p.i(¢(p).d$(p), p)) 

= F(p.i(&. p)) 

= (ip), P). 
Here the last equality follows from the definition of the map 7, in (7.3.38). This 
proves the third equation in (7.3.39) and part (11). 


That the Lie algebra of J, is isomorphic to the space Vectx,-(M) of complete 
Killing vector fields under the Lie algebra isomorphism 


Vectx,<(M) > T., Jp : X +> (X(p),dX(p)) 


follows from Lemma 7.3.12. This proves part (iii) and Theorem 7.3.1. A 


Corollary 7.3.13 Let (p,e) € O(M). Then there exists a constant ¢ > 0 with the 
following significance. If 6 : M — M is an isometry that satisfies 


d(p,b(p)) <é, |e — db(p)e|Ruxm < 8, 


then there exists a complete Killing vector field X € Vectx,-<(M) whose flow has 
the time-1-map fy = ¢. 


Proof Use the construction of the map © : 2 — G, and the Claim in the proof of 
Lemma 7.3.12 with ¢p = id to obtain ¢ € O(2 /N (R* x {0})). Oo 


7.3.5 Examples and Exercises 


Throughout we denote by Jo(M) C 1(M) the connected component of the identity 
in the group of isometries of a Riemannian manifold M. 


Example 7.3.14 The isometry group of IR” is the group of all affine transforma- 
tions of IR” with orthogonal linear part (Exercise 5.1.4). 


Example 7.3.15 We have seen in Example 6.4.16 that the isometry group of the m- 
sphere S” C Rt! is the group 7(S”) = O(m + 1) of orthogonal transformations 
of the ambient space. In the case m = 2 a theorem of Smale [72] asserts that the 
inclusion O(3) = 7(S*) <> Diff(S7) of the isometry group of the 2-sphere into the 
infinite-dimensional group of all diffeomorphisms of the 2-sphere is a homotopy 
equivalence. 
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Example 7.3.16 In Sect. 6.4.4 we have introduced the hyperbolic space H’”. Its 
isometry group is the group J(H”) = O(m, 1) of all linear transformations of R””*! 
that preserve the quadratic form Q in (6.4.9) (Exercise 6.4.25). In the case m = 2 
the identity component of this group is isomorphic to the Lie group PSL(2, R) = 
SL(2, R)/{+1}. Geometrically this can be understood by examining the upper half 
space model of H? (Exercise 6.4.27). 


The preceding three examples are the constant sectional curvature manifolds dis- 
cussed in Sect. 6.2 and Sect. 6.4. In all three cases the isometry group has the 
maximal dimension dim(7(M)) = m(m + 1)/2 and is diffeomorphic to the or- 
thonormal frame bundle O(M), so these examples satisfy the condition J, = G, 
in the notation of Sect. 7.3.1. By Corollary 6.4.13 a complete, connected, simply 
connected manifold M satisfies J, = G, if and only if it has constant sectional cur- 
vature. 


Exercise 7.3.17 Consider the incomplete 2-manifolds 
My := R? \ {(0, 0)}, M, := R*\ 2’. 


Prove that for i = 0,1 every isometry of M; extends to an isometry of R* and 
so 1(M;) is a subgroup of the Lie group 7(IR7) of affine maps with orthogonal 
linear part (Example 7.3.14). The isometry group of Mo is isomorphic to the Lie 
group O(2). The isometry group of Mj, is discrete and is an example of a so- 
called wallpaper group (of which there are 17). The Lie algebra of Killing vector 
fields in both cases has dimension 3. Which Killing vector fields are not complete? 
Hint: A Killing vector field on a connected open set M C R? is a smooth map 
M > R*: (x,y) & (u(x, y), v(x, y)) that satisfies the equations d.u = dv = 0 
and 0,u + 0,v = 0. Deduce that the map (u, v) is affine and has a skew-symmetric 
linear part. 


Example 7.3.18 The identity component of the isometry group of the complex 
projective space CP” (Example 2.8.5) with the Fubini-Study metric (Exam- 
ple 3.7.5) is the group 


Jo(CP") = PSU(n + 1) = SU(n + 1)/Z(SU(n + 1) & Un + 1)/U(). 


Here Z(SU(n + 1)) = {Al|A € S!, Ant! = 1} & Z/(n + 1)Z is the center of 
the group SU(n + 1). We emphasize that the dimension n(n + 2) of the isometry 
group 7(CP") is smaller than the dimension n(2n + 1) of the orthonormal frame 
bundle O(CP”) unless n = Oorn = 1. 

In the case n = 1 the projective line CP’ is isometric to the 2-sphere by stere- 
ographic projection (Exercise 2.8.13 and Example 3.7.5). Hence the identity com- 
ponent PSU(2) of the isometry group of CP’ is isomorphic to the identity compo- 
nent SO(3) of the isometry group of S*. An explicit isomorphism is discussed in 
Exercise 2.5.22. 
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The full isometry group 7(CP”) has two connected components. In the case 
n = 2 it is an open question whether the inclusion 7 (CP?) <> Diff(CP’) of the 
isometry group of CP” into the group of all diffeomorphisms of CP? is a homotopy 
equivalence. By deep results of Gromov [21] and Taubes [74] a positive answer to 
this question is equivalent to the assertion that the space of symplectic forms on CP? 
in a fixed cohomology class is contractible (the symplectic uniqueness conjecture 
for CP’). It is not even known whether this space is connected or, equivalently, 
whether every diffeomorphism of CP” that induces the identity on cohomology 
is isotopic to the identity. For a more detailed discussion see [67, Example 3.4] 
and [49, Example 13.4.1]. 


Example 7.3.19 The identity component 7)(S? x S?) of the isometry group 
of the product manifold M = S? x S? is the product group SO(3) x SO(3). In 
contrast to Smales’ Theorem the inclusion Jo(S? x S?) <> Diffy(S? x S$”) into 
the group of diffeomorphisms of S? x S? that are isotopic to the identity is not 
a homotopy equivalence. For example, if ¢,.9 : S? — S? denotes the rotation 
about the axis through x € S? by the angle 6 € R/2zZ and the diffeomor- 
phism Wp : S* x S? — S? x S? is defined by 


Wo(x, ¥) = (x, bx6(¥)), x,yeS’, 


then the loop R/22Z — Diffo(S* x S*) : 8 +> Wo is not contractible and neither is 
any of its iterates. Thus Diffo(S? x S*) has an infinite fundamental group while the 
fundamental group of SO(3) x SO(3) is finite. For S* x S? it is an open question 
whether every diffeomorphism that induces the identity on cohomology is isotopic 
to the identity. For a more detailed discussion see [67, Example 3.5] and [49, Ex- 
ample 13.4.2]. 


7.4 lIsometries of Compact Lie Groups* 


In the following theorem we denote by Jo(M) the connected component of the 
identity in the group of all isometries of a manifold M. 


Theorem 7.4.1 Let G C GL(n, R) be a compact connected Lie group equipped 
with a bi-invariant Riemannian metric. If @ € Ip(G), then there exist elements 
a,b € G such that 


$(g) = dan(g) = agbh' forallg €G. (7.4.1) 


The proof of Theorem 7.4.1 makes use of the Killing form introduced in Exam- 
ple 5.2.25. Recall the definition of the center Z(g) in Exercise 2.5.34 and of the 
commutant [g, g] C g in Exercise 5.2.24. The heart of the proof is Lemma 7.4.3. 
The case where the Lie algebra has a nontrivial center is then dealt with in 
Lemma 7.4.5. 
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Lemma 7.4.2 Let g be a finite-dimensional Lie algebra that admits an invariant 
inner product and has a trivial center. Then the Killing form on q is nondegenerate. 


Proof Exercise 5.2.27. Oo 


Lemma 7.4.3 Let g be a finite-dimensional Lie algebra and assume that the 
Killing form on q is nondegenerate. Then the following holds. 


G) Z(g) = {0} and [g, 9] = g. 

(ii) The Lie algebra homomorphism ad : g — Der(g) is bijective. 
(iii) Every derivation 5 : g¢ — g has trace zero. 

(iv) Let 6 : g — g bea linear map such that, for all €,y,€ € g, 


SUE. m}.o] = U158, n]. o] + ULE. dn], o] + (IE. 0]. 82]. (7.4.2) 
Then there exists a unique element &; € q such that 5& = [&;, &] for all € € q. 


Proof We prove part (i). If & € Z(g), then ad(€) = 0, hence x(&,7) = 0 for 
all 1 € g, and hence € = 0 by nondegeneracy of the Killing form. To prove 
that [g, gq] = g, assume that A: g— R is any linear functional that vanishes 
on the commutant [g,q]. Since the Killing form is nondegenerate, there exists 
an element € € g such that A = «(¢,-). Hence 0 = x (, [E, n]) = «((f, €], n) for 
all &,1 € g. Since the Killing form is nondegenerate, this implies [¢,&] = 0 for 
all € € g, hence € € Z(g), hence ¢ = 0, and hence A = 0. This proves (i). 

We prove part (ii). The map ad: g — Der(q) is injective by part (i). Choose 
a basis &,...,& of g. Since the Killing form is nondegenerate, there exists 
a dual basis 7),...,7% of g such that «(&;,n;) = 6); for all i, 7. These bases 
satisfy € = )°, «(n;,0)& for all € € gq. Now let 5 € Der(q) and define & := 
>; trace(5 ad(&;)) n; € g. Then, for all € € g, we have 


trace(5 ad(¢)) = Yelm, £)trace(6 ad(&;)) =«K(&,0) = trace(ad(€)ad(¢)). 


t 


Thus the derivation ¢ := 6 — ad(&) satisfies trace(e ad(€)) = 0 for all € € g. Since 
[e, ad(7)] = ad(en), this implies 


(en, 6) = trace(ad(en)ad(¢)) = trace([e, ad(n)]ad(¢)) 
a trace(e[ad(n), ad(£)]) a trace(é ad([n, ¢))) =0 


for all n,¢ € g. Hence ¢ = 0 by nondegeneracy of the Killing form and hence 
6 = ad(&). This proves (ii). 

We prove part (ili). Since trace(ad({é, 7])) = trace([ad(&), ad(y)]) = 0 for 
all &,n € g and [g,g] = g by part (i), we have trace(ad(€)) = 0 for all & € g. 
Hence (iii) follows from part (ii). 
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We prove part (iv). Define the bilinear map Bs : g x g > g by 


Bs(§.n) = 4[&. n] — [68. n] — [§. 6m] (7.4.3) 
for €, € g. Then equation (7.4.2) can be expressed in the form 
Bs(1&. nl. 6) + [Bs(&.n), ¢] = 0 (7.4.4) 
for &,y,¢ € g. By part (i) there exists a basis of g consisting of vectors of the 
form e; = [&,n;]. Define the linear map As: gq —> gq by Ase; := —Bs(&,7;). 
Then Bs(e;,¢) = [Ase;, ¢] for all i and ¢ by (7.4.4). Hence 
Bs(§,n) = [As&, n] = [§, Asn] (7.4.5) 


for all &, 7 € g. Here the second equality holds by the skew-symmetry of B;. By the 
Jacobi identity and equations (7.4.4) and (7.4.5) we have 


2[Bs(&. 7), 6] = [[E, Asn]. 6] + [[As&. nl. ¢] 
= —[[¢, &], Asm] — [[Asn, €], €] + (L458, 1], 6] 
= —Bs((¢,&],m) — [Bs(7. 2), €] + [[Asé. nl]. 6] 
= [Bs(¢.&).n] + Bs(In. 2], €) + [[4s6. nl]. ¢] 
= [[¢, As&], 7] + [In. €1, Asé] + [[As8, n]. o] = 0 
for all &,,¢ € g. Since Z(g) = {0} by part (i), we find that B;(&,7) = 0 for 


all €,7 € g, hence 6 is a derivation, and hence 6 is in the image of the map 
ad : g — Der(g) by part (ii). This proves (iv) and Lemma 7.4.3. Oo 


Lemma 7.4.4 The assertion of Theorem 7.4.1 holds under the additional assump- 
tion that the center of the Lie algebra g = Lie(G) is trivial. 


Proof Let ¢@ €1)(G) and choose a smooth isotopy [0,1] > Jo(G): tbh ¢, 
joining the identity ¢) =id to ¢, = ¢. For 0<t <1 define the diffeomor- 
phism w,; : G > G by W,(g) := ¢,(1) '@;(g). Then each w; is an isometry and 
the path [0, 1] + Jo(G) : t > vy; satisfies 


Yo=id, wd)=1 


for all t. By Theorem 5.3.1 the derivatives YW, := dy,(1) : g > g preserve the 
Riemann curvature tensor of G at 1 and by Example 5.2.18 (for the standard metric 
on Lie subgroups of O(77)) and Exercise 5.2.22 (for general bi-invariant Riemannian 
metrics) this translates into the condition 


WALLE. mn]. o] = [WE Yen], % EC] (7.4.6) 


for all ¢ and all &,7,¢ € g. Hence the endomorphism 6, := vty, :g3>g 
satisfies (7.4.2) for each t. Moreover, by Lemma 7.4.2 the Killing form on g is 
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nondegenerate. Hence it follows from Lemma 7.4.3 that there exists a smooth 
path [0, 1] > g: f+ & such that ¥-! LW, = ad(é,) for all t. Thus 


d 
at = dé), HHL (7.4.7) 


Now let [0, 1] ~ G: t + 5, be the solution of the differential equation 


d 
Gt = bi, bo =, (7.4.8) 


and define ®, := Ad(b,) (Example 2.5.23). Then £8, = @,ad(&,) for all ¢ 
and ®) = 1. Thus @, = ¥Y and so 


W,n = bynb;' (7.4.9) 


for all t and 7. Take t = 1, define b := b,, and use Lemma 5.1.10 (uniqueness of 
local isometries) to obtain ¥(g¢) = bgb~ for all g € G. Hence 


6(g) = éDwWi(g) = o()bgb! = agh" 


for all g € G with a := $(1)b. This proves Lemma 7.4.4. Oo 


We will denote by Z(G) C Z(G) the connected component of 1 in the cen- 
ter of G. Then Zo(G) C G is a compact connected abelian Lie subgroup of G 
(Exercise 2.5.34). Since G is connected, its Lie algebra is the center of gq, ie. 
Lie(Zo(G)) = Z(g). 


Lemma 7.4.5 Let G C GL(n, R) be a compact connected Lie group with a bi- 
invariant Riemannian metric and let @ € Io(G). Then 


p(hg) = b(gh) = $(g)h = hd(g) (7.4.10) 
forall g € Gandallh € Z(G). 
Proof The proof relies on the following basic observations. 


(a) Zo(G) is a compact connected Lie subgroup of G and Lie(Zo(G)) = Z(q). 

(b) exp(§ + 9) = exp(&) exp(7) for all &, 7 € Z(g). 

(c) The exponential map exp : Z(g) — Zo(G) is surjective. 

(d) A := {&€ € Z(g)| exp(&) = 1} is a discrete additive subgroup of Z(g) which 
spans Z(q). 


Part (a) was noted above, part (b) follows from Exercise 2.5.39 because the Lie 
algebra Z(q) is commutative, and part (c) follows from the Hopf—Rinow The- 
orem 4.6.6. That the set A is an additive subgroup of Z(q) follows directly 
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from (b). Moreover, by (a) and (b) the exponential map restricts to a local dif- 
feomorphism exp : Z(g) — Zo(G). Hence A is discrete and by (c) the exponential 
map descends to a proper map from Z(q)/A onto Z(G). Since Z(G) is compact, 
the lattice A spans the vector space Z(g). 

Now assume (1) = 1 and define ® := dg(1) : g > g. Then 


(exp(&)) = exp(@é) (7.4.11) 


for all € € g by Corollary 5.3.3 and Example 5.2.18. Moreover, @ is an orthogonal 
transformation of g that preserves the Riemann curvature tensor (Theorem 5.3.1). 
Thus |[®&, @y]| = |[E, n]| for all &,7 € g by Example 5.2.18. So, if & € Z(g), 
then [€, 7] = 0 for all n, hence [®&, ny] = 0 for all n, and hence PE € Z(g). This 
shows that 


PZ(g) = Z(Q). (7.4.12) 


By (7.4.11) and (7.4.12) we have & € A if and only if ¢(exp(€)) = 1 if and only 
if exp(@£) = 1 if and only if SE € A, so that SA = A. Since ¢ is isotopic to 
the identity through isometries, by assumption, there exists a smooth path of or- 
thogonal transformations ®, of g from %) = 1 to ; = @ that satisfy ®, A = A 
for all t. Thus ®€ = & for all & € A and so for all € € Z(q) because the lattice 
spans the subspace Z(q) by (d). Hence, by (7.4.11) and (c), we obtain (A) = h for 
all h € Z(G) whenever (1) = 1. 

To prove equation (7.4.10) in general, fix an element g € Gand define the diffeo- 
morphism ¢, : G > G by ¢,(h) := $(g)~'¢(gh) for h € G. Then ¢, is an isome- 
try and ¢,(1) = 1. Moreover, , € Jp(G), because G is connected. Hence $,(h) = h 
for all h € Zo(G) and this proves Lemma 7.4.5. O 


Proof of Theorem 7.4.1 Let @ € Io(G). By Lemma 7.4.5, @ descends to a diffeo- 
morphism ¢ : G + Gof the quotient group 


G := G/Z,(G). 
The Lie algebra of G is the quotient space 


o= 6/20 = Za 


and the invariant inner product on g restricts to an invariant inner product on the or- 
thogonal complement of Z(q). This defines a bi-invariant Riemannian metric on G. 
The diffeomorphism ¢ is an isometry with respect to this metric, because the deriva- 
tive db(g): T,G — Tg g)G is an orthogonal transformation, which by (7.4.10) sends 
a tangent vector gn € T,G with n € Z(qg) todd(g)gn=(g)n € Tg(g)G, and hence 
it sends the subspace gZ(q)+ C T,G to ¢(g)Z(g)+ C Tycg)G. Apply Lemma 7.4.4 
to the isometry to obtain elements a,b € G whose equivalence classes a, beG 
satisfy (2) = a@gb~! for all g € G. This implies 


a(g) := a~'(g)bg"! € Z(G) (7.4.13) 
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for all g € G. Moreover, it follows from Lemma 7.4.5 that 


a(gh) = a(g) (7.4.14) 
for all g € Gand all h € Zo(q). Now define the isometry w : G > G by 
W(g) = a"'$(g)b = a(g)g (7.4.15) 
for g € G. Fix an element g € G and define the linear maps A, W: g > g by 
Ag := a(gy'da(g)gé, WE = W(g)'dw(g)gé (7.4.16) 


for eg. Then W=id+A by (7.4.15) and the map A vanishes on Z(g) 
by (7.4.14) and takes values in Z(q) by (7.4.13). Since W is an orthogonal transfor- 
mation, this implies 


W = id, A=0. 


Hence it follows from the definition of the linear map A in (7.4.16) that the 
map a : G > Z(G) in (7.4.13) is constant. Thus 


$(g) = a(l)agh" 


for all g € G, and this proves Theorem 7.4.1. O 


Corollary 7.4.6 Let G be a compact connected Lie group equipped with a bi- 
invariant Riemannian metric. Then the map (a,b) > ¢q,5 in Theorem 7.4.1 de- 
scends to a Lie Group isomorphism 


pa: (GX G)/Z(G) > HC). (7.4.17) 


Proof The map G x G — 4)(G) : (a,b) +> $4, is a group homomorphism by def- 
inition, is smooth by Theorem 7.3.1, and is surjective by Theorem 7.4.1. Moreover, 
gap is the identity if and only if a = b € Z(G). Hence the map pg in (7.4.17) is 
a Lie group isomorphism, with the smooth structure on the quotient group (G x 
G)/Z(G) determined by Theorem 2.9.14. This proves Corollary 7.4.6. Oo 


Example 7.4.7 Theorem 7.4.1 establishes a one-to-one correspondence between 
isometries @ € Jo(G) that satisfy (1) = 1 (the case a = b) and Lie group auto- 
morphisms of G in the identity component. This correspondence does not extend to 
other connected components. 

For example, in the case G = T” = R"/Z" the group of automorphisms of T” 
is the infinite group Aut(T'”) = GL(n, Z) of integer matrices with determinant +1, 
while the group of isometries that fix the origin is the finite subgroup O(n, Z) C 
GL(n, Z) of orthogonal integer matrices. Also, if G is not abelian, then the isom- 
etry G> G: gt g’! is not a Lie group automorphism, but a Lie group anti- 
automorphism. 
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Exercise 7.4.8 Examine the case G = SU(2) (Example 2.5.21) and deduce that 
there exists a Lie group isomorphism 


SO(4) = (SU(2) x SU(2))/{+1}. 


Exercise 7.4.9 Let G be a compact Lie group equipped with a bi-invariant Rie- 
mannian metric and let g := Lie(G). Show that the formula 


e358) |= (8¢' — 86, [6,8’] + zad([C, ‘‘))) (7.4.18) 


for ¢,¢’ € g and 6,6’ € Der(g) defines a Lie bracket on g x Der(g). Show that 
the map Vectx(G) > g x Der(g) : X & (X(1), VX(1)) identifies the Lie al- 
gebra Vectx(G) = Lie(J(G)) of Killing vector fields on G with the Lie alge- 
bra g x Der(q). Show that the homomorphism (7.4.17) induces the surjective Lie 
algebra homomorphism 


gx gq gx Der(g) : En) + (€—7, sad +n), (7.4.19) 


whose kernel consist of all pairs (€, 7) € g x g that satisfy § = n € Z(q). Show 
that dim(Der(q)) = dim(g) — dim(Z(q)). 
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The last two sections of this book are devoted to Donaldson’s beautiful paper [17] 
in which he develops a differential geometric approach to Lie algebra theory. The 
results of [17] will be explained in reverse order. The first subsection examines the 
sphere at infinity of a Hadamard manifold M and contains a proof of [17, Theo- 
rem 4], which asserts that every convex function f : M — R that is invariant under 
the action of a Lie group G by isometries must attain its minimum whenever the 
G-action has no fixed point at infinity (Sect. 7.5.1). The next subsection deals with 
the special case of [17, Theorem 3], where M is the manifold of inner products on a 
vector space V on which a Lie group G C SL(V) acts irreducibly (Sect. 7.5.2). If G 
is the identity component of the isotropy subgroup of a nonzero vector w € W un- 
der a representation of the special linear group p : SL(V) — SL(W), then by [17, 
Theorem 2] there exists an inner product on V for which the Lie algebra q of G is 
symmetric (Sect. 7.5.3). This is used in [17, Theorem 1] in the case where V = g 
is a simple Lie algebra, w is the Lie bracket, and G is the identity component of the 
group of automorphisms of g, to establish the existence of symmetric inner products 
on g and deduce various standard results in Lie algebra theory. These applications 
to Lie algebra theory are deferred to the next section. 
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7.5.1 Convex Functions and The Sphere at Infinity 
Assume throughout that M is a Hadamard manifold, i.e. a nonempty, connected, 
simply connected, complete Riemannian manifold with nonpositive sectional curva- 
ture (Definition 6.5.1). For p € M denote the unit sphere in the tangent space 7, M 
by 

Sp := {v € TM | |v| = 1}. 


Their union determines a submanifold SM := {( P.v)|pEeM,ve ant of the tan- 
gent bundle, called the unit sphere bundle. 


Definition 7.5.1 Define an equivalence relation ~ on SM by 
def 
(p,v) ~ (g,w) — sup d(exp, (tv), exp, (tw)) < oo (7.5.1) 
t>0 


for (p, v), (g, w) € SM. The equivalence class of a pair (p, v) € SM will be de- 
noted by [p, v] := {(¢, w) € SM | (q, w) ~ (p, v)} and the quotient space 


Soo(M) := SM/~ = {[p,v]| (p.v) € SM} 
is called the sphere at infinity of M. 
The following lemma shows that the map S, > S.o(M): vu +> [p, v] is a home- 
omorphism with respect to the quotient topology on S,.(M) for every p ¢ M 


(see [17, Lemma 3]). 


Lemma 7.5.2. There exists a unique collection of maps Fy,» : Sy — Sq, one for 
each pair of points p,q € M, such that 


-1 
Fy o(v) = Ym SPE OPER) 


, (73.2) 
Roe |expz! (exp, (Rv))| 


for all p,q € M and all v € Sy. Moreover, the convergence in (7.5.2) is uniform 
on Sy, the maps Fy.) are homeomorphisms, and they satisfy 


w = Fy p(v) cs sup d(exp, (tv), exp,(tw)) < co (7.5.3) 
t>0 


for all (p, v), (q,w) € SM and 
Fg ° Fay = Fry, Fp = id (7.5.4) 


forall p,q,r €¢ M. 
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Fig. 7.1 The sphere at infin- 
ity 


expg(tw(R)/R) expp(tv) 


p 
q 
Proof Let p,q € M and define the maps Fry.) : S, > Sy by 
1 
exp, (exp,(Rv)) 
Frg.p(v) = ——* (7.5.5) 
lexp;'(exp,(Rv))| 
for R > Oand v € S,. We claim that, for all R > d(p,q) and all v € S,, 
d d(p.q) 
—F < ——____——__., 7.5.6 
ae teeo)| < gaat ee 


To see this, fix an element v € S, and define the geodesic y : R — M by 
y(t) := exp,(tv). Define the curve w : R > 7,M by w(t) := exp, (y(t) for 
t € R. Then Fryp(v) = w(R)/|w(R)| and hence 


0 
= Frgp(v) = 


w(R) w(R) w(R) \ w(R) 
iy ( 


[w(R)|  \|w(R)]? |w(R)|/ wR)" 


This is the orthogonal projection of the vector w(R)/|w(R)| onto the orthogonal 
complement of w(R). Hence its length connot decrease by adding to it a scalar 
multiple of w(R), and so 


(RK) _w(R) | _ [ro(R) = w(R)/RI 
wR) Riw(R)| jw(R)| 


57) 


a] 
Fe Fea 0) 


= | 


Next we use the expanding property of the exponential map in Theorem 6.5.2 twice, 
namely first the inequality for the derivative in part (ii) at the point g and then 
the inequality in part (ii1) at the point y(R) (see Fig. 7.1). Define the tangent vec- 
tors Up, Ug € Tyr) M by 


Up = —y(R) = -d exp,(w(K))w(R), 


d 
Ug i= a7 exp, (tw(R)/R) =-d exp, (w(R))w(R)/R. 
t=R 
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Then Theorem 6.5.2 yields the estimate 
|w(R) — w(R)/R| < |d exp, (w(R))(W(R) — w(R)/R)| = |vp — val. (7.5.8) 


Also, exp, ay(SUp) = exp,((R — 5)v), exp,;py(S¥q) = expy((R — s)w(R)/R). 
Take s = R—t and use Lemma 6.5.5 to obtain, for 0 < t < R, 


pce ee ae. M (7.59) 


= Hae = 

Since d(p, y(R)) = R and d(q, y(R)) = |w(R)|, the triangle inequality yields 
R—d(p,q) < |w(R)| < R+ d(p,q). (7.5.10) 

Combinig the inequalities (7.5.7), (7.5.8), (7.5.9), and (7.5.10) we find that 


jto(R) = w(R)/RI Ir — vl. dp.) 
wR ~ [wR] > R(R=d(p.)) 


7 


0 
phe) 


This proves the estimate (7.5.6). 

It follows from (7.5.6) by integrating from a fixed number R > d(p, q) to infin- 
ity that the maps Fr, : S, > Sy converge toamap Fy, : S, — S, as R tends to 
infinity and that 


f  d(y.a) R 
sup | Fy,p(v) — Frq.p(v) =f EP ar =e —— 
ae ae J r(r—d(p.q)) R—-d(p.q) 
for all R > d(p,q). Thus the convergence is uniform and hence the limit map F;,_, 
is continuous. 
Next we prove (7.5.3). Let p,g €¢ M and v € S,. Then 


= ar — i WAR _ yw) 
wis Fap(0) = fim Frao() = fim Toca = AR 


Here the third equality follows from (7.5.5) and the definition of the vector 
w(R) = exp, '(exp,(Rv)), and the last equality follows from (7.5.10). Hence 
exp, (tw) = limr+ooexp,(tw(R)/R), and so it follows from (7.5.9) by tak- 
ing the limit R — oo that d(exp,(tv),exp,(¢w)) < d(p,q) for all ¢ = 0. This 
proves “ = > ” in (7.5.3). The converse implication follows from the fact that, 
by Theorem 6.5.2, there can be at most one tangent vector w € Sj, satisfy- 
ing sup,.o d(exp,(tv), exp,(tw)) < oo. Thus we have proved that the maps 
Fy,p 2 Sp > Sq in (7.5.2) satisfy (7.5.3). That they also satisfy (7.5.4) follows 
directly from (7.5.3) and the fact that (7.5.1) defines an equivalence relation 
on SM. Hence each map Fy,,:S,— Sq is a homeomorphism with the in- 
verse F,, : Sg — Sp, and this proves Lemma 7.5.2. Oo 
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Lemma 7.5.3 [fd €1(M), p,q € M, and v € Sj, then 


F(q).6(p) ° 40 (Pp) = 46(q) © Fa.p : Sp > Seq): (7.5.11) 


Thus the group of isometries of M acts continuously on the sphere at infinity via 
T(M) x Soo(M) — Soo(M) : (6, [p. v]) > bs[p, v] := [6(p), db(p)v]. 


Proof Since ¢ is an isometry it satisfies $ o exp, = expyi) 0 df (q) for all gq ¢ M 
by Corollary 5.3.3. Hence 


do(q)(exp;' (exp, (Rv) ) = XP py) (#(exP,(R)) ) 
= EXP py (EXP yc) (R d(p)v)) 


and so db(q)° Fra» = Froq)o(p) 0 4¢(p) for all p,g ¢ M and all R> 0. 
Divide by the norm and take the limit R — oo to obtain (7.5.11). This proves 
Lemma 7.5.3. Oo 


Definition 7.5.4 Let G C GL(n, R) be a Lie group. A G-action on M by isome- 
tries is a Lie group homomorphism 


G—>1(M): gt x, 


Le. the map Gx M > M :(g, p)+> $,(p) is a smooth group action (Defini- 
tion 2.5.40) and the map ¢, : M — M is an isometry for each g € G. In this 
situation we say that the G action has a fixed point iff there exists an element p € M 
such that 


og(p) = P 


for all g € G. We say that the G-action has a fixed point at infinity iff the induced 
G-action on the sphere at infinity has a fixed point, i.e. there exist elements p ¢ M 
and v € S, such that 


Ape(p)v = Fo,(p),p&) 


for all g € G. If such a pair (p, v) € S(M) does not exist, we say that the G-action 
has no fixed point at infinity. 


Definition 7.5.5 A smooth function f : M — R is called convex iff the function 
f cy :R — Ris convex for every geodesic y : R > M. 


With these preparations in place we are ready to state the following existence 
theorem for critical points of a convex function (see [17, Theorem 4]). 
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Theorem 7.5.6 (Donaldson) Let M be a Hadamard manifold equipped with a 
smooth action G > 1(M): g +> ¢¢ of a Lie group G by isometries, let K be a 
compact subgroup of G, and let f : M — R be a convex function such that 


fobde=f 


for all g € G. Assume that the G-action has no fixed point at infinity. Then there 
exists an element py © M such that 


f (Po) = a f(P), du(po) = po forallu eK. (7.5.12) 


As pointed out in [17], similar results can be found in the works of Bishop— 
O’ Neill [7] and Bridson—Haefliger [11, Lemma 8.26]. We remark that the compact- 
ness of the subgroup K C G is only needed for an appeal to Cartan’s Fixed Point 
Theorem 6.5.6. If we assume instead that the action of K on M has a fixed point, 
compactness is not required. The proof of Theorem 7.5.6 is based in the following 
lemma (see [17, Lemma 5]). 


Lemma 7.5.7 Let p; be a sequence in M such that lim;_,.. d(p, pi) = & for 
some (and hence every) p € M. Let I C 1(M) be a collection of isometries of M 
such that sup; d(p;,¢(p;)) < 00 for all ¢ € I. Then the isometries in I have a 
common fixed point at infinity, i.e. there exists an element [p,v] € Sx(M) such 


that |p, v] = [p, v] for all @ € 1. 
Proof Fix anelement p € M and define 


—1 
exp, (Pi) _ 

vi = —B—, Ri = lexpy'(pi)| = d(, Pi). (7.5.13) 
for eachi € N such that p; 4 p. Passing to a subsequence, if necessary, we may as- 
sume that p; # p foralli € N and that the limit v :=lim;_,.9 v; € S, exists. We will 
prove that dé(p)v = Fy p),p(v) for all ¢ € 7. To see this, let @ € J, choose c > 0 
such that d(p;, $(pi)) < c for all i, and define 


exp, '(~(pi)) 


v= —> BR = lexp,'(G(2i))| = d(P. $(2i)).- 


Since exp, (Rj v;) = @(pi) and exp,(Riv;) = pi, Theorem 6.5.2 asserts that 
|Rivi — Riv; | < d(pi, o(pi)) < 
for all 7. Since |R; — R‘| < c by the triangle inequality, it follows that 


2c 
lae=u,|S 
R; 
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and hence 


lim R; = 0, lim v} = lim v; = v. 


ioo i>oo i—oo 


Since exp, (R}v;) = $(pi), this implies 


(v) = tim exp, ' (exp, (Ri v;)) - exp, |($(pi)) 
Fa.p i>oo |exp, (exp, (R; a) ee i>oo |expz!($(pi))| 


for all g € M. Take gq = $(p) and use the identity EXP5(p) og =dd(p)o exp,” 
and equation (7.5.13) to obtain 


do(p)exp,'(pi) 


Fg(p).p) = = do(p) lim v; = dd(p)v. 
HO) ON db PIED; (PDI ines 
Hence ¢,[p, v] = [p, v] for all ¢ € 7 and this proves Lemma 7.5.7. oO 


The next step in the proof of Theorem 7.5.6 is to examine the gradient flow of 
the convex function f : M — R. The flow equation has the form 


ye) = -VF(y), (7.5.14) 


where the gradient vector field is defined by (V f(p),v) := df(p)v for pe M 
and v € 7,M. An important consequence of convexity is that the distance between 
any two solutions of the gradient flow equation is nonincreasing. This is the content 
of the next lemma (see [17, Lemma 6]). 


Lemma 7.5.8 Let f : M — R bea smooth function. Then f is convex if and only 
if it satisfies the condition 


(VoV f(p),v) 2 0 (7545) 
forall p € M andallv € T,M. If f is convex, then the following holds. 


(i) Equation (7.5.14) has a solution y : [0,o0) — M on the entire positive real 
axis for every initial condition y(O) = Po. 

(ii) Let yp : [0,00) > M and y; : [0, co) > M be two solutions of (7.5.14). Then 
the function [0,00) > R:t + d(t) := d(yo(t), y1 (t)) is nonincreasing. 


Proof Let y:R — M bea geodesic. Then f oy : R > R is convex if and only 
if 


= < Sry =4 “(9 f(y). ¥@)) = (ViqV LY). 70) 
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for all t. This holds for all geodesics if and only if f satisfies (7.5.15). In the re- 
mainder of the proof we assume that f is convex. 
Let po € M and for T > 0 define 


Kr :={p€M|d(po, p) < T|V f(po)|}- 


This set is compact because M is complete. Now let y : [0, 7) — M be the solution 
of (7.5.14) with y(0) = po on some time interval [0, 7). Then 


d 
aVEM? = AV,V LY), VIO) = -AV/V IY), 7) < 0 


by (7.5.15). Thus the function t + |V f(y(t))| = |y(4)| is nonincreasing and 
so y(t) € Kr forO <t < T. Since Kr is a compact subset of M, the solution y 
extends to a longer time interval [0,7 + 6) for some 6 > 0 by Corollary 2.4.15. 
Since T > 0 was chosen arbitrary, this proves (i). 

We prove part (ii). Assume without loss of generality that yo(0) 4 yi (0) and so 
yo(t) # y(t) for all ¢. For ¢ > 0 let [0, 1] ~ M : st} y(s,t) be the unique geodesic 
that satisfies y(0,t) = yo(t) and y(1,t) = y(t). Then d(t) = d(yo(t), yi (t)) = 
L(y(..t)) = |dsy(s, t)| for all s,t and hence 


1 
i= | (dsv(s,t), Vidsy(s, t)) 


1 
1 0 
1 
dy) 


((s7C1.2), VF(A.0)) = (8570.2), ¥F7O.0))) 
i 


1 
= 0) [o6.0. Vas ys.) F(y(S,t))) ds < 0 


0 


by (7.5.15). This proves (ii) and Lemma 7.5.8. Oo 


Lemma 7.5.9 Let f : M — R be a convex function that has a critical point Doo. 
Then f(p) = f(Pxo) =: ¢ forall p € M, and the set Cp := f~'(c) of minima of f 


is geodesically convex. 


Proof Let p € M and let y : [0,1] ~ M be the unique geodesic with the end- 
points y(0) = px and y(1) = p. Then B := f oy: [0,1] > R is a convex func- 
tion satisfying B(O) = f(pxo) =c and B(0) = 0, hence A(t) > c for all ¢, and 
so f(p) = BC) = c. Thus f attains its minimum at po. 

Now let po, pi € Cy and let y : [0, 1] > M be the unique geodesic with the end- 
points y(0) = po and y(1) = p,. Then the function B := f oy: [0,1] > Ris con- 
vex, satisfies B(0) = 6(1) = c, and takes values in the interval [c, 00). Hence B =c 
and so y(t) € Cy for all t. This proves Lemma 7.5.9. 
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Proof of Theorem 7.5.6 Choose an element po € M and let y : [0,00) > M be 
the unique solution of equation (7.5.14) that satisfies the initial condition y(0) = po. 
Assume first that 


up d(po, y(t)) = 00 (7.5.16) 


and choose a sequence ¢; — oo such that limj_,.. d(po, y(t;)) = co. Now let 
g €G. Since ¢, : M — M is an isometry and f o @ = f, it follows that 
db.(p)V f(p) = Vf (og(p)) for all p ¢ M (Exercise: Prove this.) Hence the 
curve [0,00) > M:tt> $,(y(t)) is another solution of equation (7.5.14) and 
hence d(y(t),bg(v(ti))) < d(po.¢g(po)) for all 7 and all g € G by part (ii) of 
Lemma 7.5.8. Hence Lemma 7.5.7 asserts that there exists a (p, v) € SM such that 
do,(p)v = F%,(p),p(v) for all g € G, in contracdiction to our assumption that the 
G-action has no fixed point at infinity. 
This shows that our assumption (7.5.16) must have been wrong. Thus 


sup d(po, y(t)) =: R < ov, 
t>0 


and so our solution y : [0,co) > M of (7.5.14) takes values in the compact set 
B:={p€M|d(po, p) < R}. Since the function t + f(y(t)) is nonincreasing, 
this implies that the limit 


c:= lim f(y(t)) => min f(p) (7.5.17) 
too peB 
exists and is a real number (and not —oo). Since £ f(y(t)) =-—|V f(y(t))|? and the 


function t + f(y(t)) is bounded below by c, there must exist a sequence t; > oo 
such that 


lim V f(y(i)) = 0. (7.5.18) 
Since y(t;) € B for all i, we may also assume that the limit 
Poo += lim y(t) (7.5.19) 


exists (after passing to a subsequence, if necessary). This limit is a critical point 
of f by (7.5.18), and f(po) = c by (7.5.17). Hence, by Lemma 7.5.9, f attains its 
minimum at poo and the set Cr := {p € M | f(p) = c} of minima of f is geodesi- 
cally convex. We must find an element of Cy that is fixed under the action of K. By 
Cartan’s Fixed Point Theorem 6.5.6 there exists a g € M such that ¢,(q¢) = q for 
all u € K. Since M is complete and Cy is a nonempty closed subset of M, there 
exists an element po € Cy such that 


d(q, Po) = inf d(q, p) =: 6. (7.5.20) 
peCy 
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We claim that ¢,(po) = po for all wu © K. To see this, fix an element u € K, 
let y : [0,1] ~ M be the geodesic joining y(0) = po to y(1) = ¢, (po), and denote 
by m := y(1/2) the midpoint of this geodesic. Then Lemma 6.5.7 asserts that 


2d(q,m) + < d(q, po)” + Aq, bu(po))’. (7.5.21) 


d(po, Pu (po))* 
2 

Since ¢,,(q) = q and @,, is an isometry, we have d(q, ¢,(Po)) = d(q, po) = 4, and 

since Cy is geodesically convex, we have m € Cy and so d(q,m) => 6. Hence it 

follows from (7.5.21) that po = ¢,(po). This shows that po € Cy is a fixed point 

for the action of K on M and proves Theoren 7.5.6. O 


Example 7.5.10 To illustrate the argument in the proof of Theorem 7.5.6, 
take M = C with the standard flat metric. Then the equivalence relation on the 
sphere bundle SM = C x S'! is given by translation in C and so the sphere at 
infinity is S,,(M) = S!. The orthogonal group G = O(2) acts by isometries on M 
and has no fixed point at infinity. The subgroup K = Z/2Z acts by complex con- 
jugation and its fixed point set is the real axis. Choose a smooth convex function 
h:R— R that vanishes on the interval [—1, 1] and is positive elsewhere. Then 
the function f(z) := A(|z|) is convex and G-invariant and the set Cy of minima 
of f is the closed unit disc in C. If g = 2 is the fixed point of the K-action chosen 
in the proof, then pp = 1 € Cy. If G=K = Z/2Z, then +1 are the fixed points 
at infinity and f(z) := e®°©) is convex and G-invariant, but does not take on its 
infimum. 


Example 7.5.11 ({17]) Consider the case where M = D” is the Poincaré model 
of hyperbolic space (Exercise 6.4.22). Then the sphere at infinity is the bound- 
ary OD” = S’~! (Exercise 6.4.24). If the convex function f extends continuously 
to the closed ball and does not take on its minimum in M, then it attains its mini- 
mum at a unique point on the boundary, because any two boundary points are the 
asymptotic limits of a geodesic in M. Hence the minimum on the boundary is fixed 
under the action of any Lie group on M by isometries, that leave f invariant. This 
is reminiscent of the Kempf Uniqueness Theorem in GIT (see [37] and [20, Theo- 
rem 10.2]), where M = G/K is associated to the complexification G of a compact 
Lie group K and f is the Kempf—Ness function (see [38, 53] and [20, §4]). 


7.5.2 Inner Products and Weighted Flags 


We will now turn to a specific example, where the Hadamard manifold is the space 
of positive definite symmetric matrices with determinant one (Sect. 6.5.3). Fol- 
lowing [17], we choose a finite-dimensional real vector space V equipped with a 
fixed inner product (-,-). Then every inner product on V has the form (v, v’) p := 
(v, P~!v’) for some self-adjoint positive definite automorphism P. Denote the set 
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of such automorphisms with determinant one by 
Po(V) = {P e€ End(V) | P* = P > 0, det(P) = 1}. (7.5.22) 


Here P* € End(V) is defined by (v, P*v’) := (Pv,v’) for v,v’ € V, and the nota- 
tion“P > 0” means (v, Pv) > Oforallv € V \ {0}. Thus Po(V) is a codimension- 
1 submanifold of the space of self-adjoint endomorphisms of V. Its tangent space 
at P € Po(V) is given by 

TpPo(V) := {P € End(V)| P = P’, trace(P P~') = 0}. (7.5.23) 


The Riemannian metric on Po(V) is defined by 


|Plp := trace(P P-'P P-*) (7.5.24) 


for P € Po(V) and Pe Tp Po(V) as in Sect. 6.5.3, and so Po(V) is a eae 
manifold by Theorem 6.5.10. For Pe Tp vat) the endomorphism P P~' is self- 
adjoint with respect to the inner product (-, P~!-) on V. 

The group SL(V) C GL(V) of automorphisms of V with determinant one acts 
on Po(V) by the isometries ¢,(P) := gPg* for g € SL(V). The isotropy subgroup 
of 1 € Po(V) is the special orthogonal group SO(V). The action of a subgroup 
G Cc SL(V) on V is called irreducible iff there does not exist a linear subspace 
E CV, other than E = {0} and E = V, such that gE = E for all g € G. This 
notion can be used to carry over the general existence theorem in Sect. 7.5.1 for 
critical points of a convex function to the present setting (see [17, Theorem 3]). 


Theorem 7.5.12 (Donaldson) Let G C SL(V) be a Lie subgroup such that the 
action of G on V is irreducible. Let K C G be a compact subgroup and let f : 
Po(V) = R be a convex function such that f(gP g*) = f(P) forall g € Gand all 
P €P(V). Then there exists a Py € Po(V) such that 


f(Po) =. inf f(P), uPou* = Po forallu € K. (7.5.25) 
PePo(V) 
The goal will be to deduce Theorem 7.5.12 from Theorem 7.5.6. Thus we must 


understand the sphere at infinity of the space Po(V). This will be accomplished 
with the help of the following definition. 


Definition 7.5.13 A weighted flag in V is a pair (F, 2), where F is a finite se- 
quence of linear suspaces 


l= MCHrChHhCc-:::Cr=V 


such thatn; := dim(F;)/ dim(F;_;) > Ofori = 1,...,7r, and pis a finite sequence 
of real numbers 4; > [2 > ++: > jy Satisfying the conditions 


eo = 0, Se = 1. (7.5.26) 
i=1 i=1 
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Let F = F(V) be the set of weighted flags. The group SL(V) acts on F(V) by 
g- (Fh) := (¢Fi, mi); for (F, w) = (Fi, mii € F(V) and g € SL(V). 


For P € Po(V) the unit sphere in the tangent space Tp Po(V) is the set 
Sp:= {P =P €End(V) | trace(P P~') = 0, trace(P P-'P P~') =}. 


Let P € Po(V) and P € 8p. Then the endomorphism PP is self-adjoint 
with respect to the inner product (-,P~!-) and hence has only real eigenval- 
ues [41 > [lz >++: > fy. For each i let E; C V be the eigenspace for jz; and 
define n; := dim(E;). Since trace(P P~') =0 and trace(P P-' P P-') =1, 
the j4;,n; satisfy (7.5.26). The weighted flag of (P, P) is defined by 


tp(P) := (F, w) = (Fi mi), «+ Fs Br) © FV), (7.5.27) 


where F; := E; ® E, ®---@ E; fori = 1,...,r. For each P € Po(V) the map 
lp :Sp — F(V) defined by (7.5.27) is bijective, and thus induces a (compact, metriz- 
able) topology on F(V). This topology is independent of P as the next lemma 
shows. The lemma also shows that the space of weighted flags is the sphere at in- 
finity (see [17, Lemma 4]). 


Lemma 7.5.14 The equivalence relation in Definition 7.5.1 on the unit sphere 
bundle S‘Po(V) is given by 


(P,P)~(0,0) <= — (P) = 19 (0) (7.5.28) 
for P,O EPo(V), Pe Sp, and O € 8g. Thus the map F9,p : 8p — SQ as defined 
in Lemma 7.5.2 is given by Fg,p = tg tp'. Moreover, the map SPo(V) > F(V) : 
(P, P) b> t»(P) is SL(V )-equivariant, i.e. 

lepet (GP g*) = g-tp(P) (7.5.29) 
forall P € Po(V), all Pe Sp, andall g € SL(V). 
Proof Since (gPg*)(gPg*)! = g(PP~!)g~! for all P € Sp and g € SL(V), 
equation (7.5.29) follows directly from the definitions. The proof that the equiva- 


lence relation satisfies (7.5.28) rests on the following claims. 


Claim 1 Let S € 84, let (Fj, wi)i_, = 41 (S) be the flag associated to S, and 
leth € SL(V) be an automorphism of V with determinant one such that 


hF; = F; fori =1,...,7r. (7.5.30) 


Then (1, S) ~ (hh* ,ASh*). 
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Claim 2 Let g € SL(V) and fix any flag {0} = Fo S Fi S-:+ S Fy = V. Then 
there exist elements h € SL(V) and u € SO(V) such that h satisfies (7.5.30) 
and g = hu. 


We prove, that these two claims imply (7.5.28). Fix any element S € Sj, let 
(Fi, “i);_, = 1(S) be the weighted flag of S, and let P € Po(V). Choose an 
element g € SL(V) such that gg* = P, and choose u and fh as in Claim 2. 
Then hh* = P, and the pairs (P, P) := (hh*,hSh*) and (1,5) have the same 
flag by (7.5.30) and are equivalent by Claim 1. Hence any pair (P, P) is equivalent 
to (1, S) if and only if it has the same flag. By transitivity of the equivalence relation 
we deduce that (7.5.28) holds for all P, 0 € Po(V). 

We prove Claim 2. Let F/ := eg! F; and m; := dim(F;) fori = 1,...,r. Then 


choose orthonormal bases e1,...,@,, and el: ee sy Op of V such that for, each 7, 
the vectors ¢),...,@»,; form a basis of F; and the vectors e},..., Co form a basis 
of F/. Define the orthogonal transformation u by we) := e; fori = 1,...,m. It 


satisfies F/ = u—! F, and hence gu! F; = F, for all i. Thus h := gu7'! satisfies 
the requirements of Claim 2. 

We prove Claim 1, following [17, Lemma 4]. Define the geodesics yo, y; 
in Po(V) by yo(t) = exp(tS) and y(t) = hexp(tS)h* (see Lemma 6.5.18). By 
equation (6.5.12) the square of their distance is given by 


p(t) := d(exp(tS), hexp(tS)h*)? = trace( (log(M(1)M(1)*))’), 
M(t) := exp(—tS/2)h exp(tS/2). 


(7.5.31) 


In the eigenspace decomposition V = EF, ®--- @ E, of the self-adjoint endomor- 
phism S the automorphisms / and exp(tS/2) have the form 


Ay Ay ss Ay 
pa) 9 An 
hieig (7.5.32) 
O ++ O hy 
exp(tS/2) = diag(e"/7 1p, ; e221, ee eft T ey). 
Here the upper triangular form of 4 follows from (7.5.30). Hence 
Ay Ayt) +--+ A(t) 
Mea \" we a (7.5.33) 
: hy-tr (t) 
0 tee 0 Ney 


where /ij; (t) := eHi-Ki)/2 fy, forl <i <j <r. Since p; > p; fori < j, it fol- 
lows that the limit M., := lim;... M(t) = diag(/1,...,/,;,) exists and is an in- 
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vertible endomorphism of V. Hence the function p: [0,c0o) > R in (7.5.31) is 
bounded. This proves Claim | and Lemma 7.5.14. Oo 


Proof of Theorem 7.5.12. Let GC SL(V) be a Lie subgroup which acts irre- 
ducibly on V. Then G acts on Po(V) by isometries. By Lemma 7.5.14 the induced 
action on the sphere at infinity S.(Po(V)) = F(V) is given by G x F(V) > 
FV) : (g, Fi, Midi=t....r) te (9 Fi, Mi)i=t....r- This action has no fixed points 
because the action of G on V is irreducible and r > 2 for each weighted flag 
(F;, i)i=t....r € FV). Hence all the assertions of Theorem 7.5.12 follow directly 
from Theorem 7.5.6 with M = Po(V). Oo 


7.5.3 Lengths of Vectors 


The material in this section goes back to ideas in geometric invariant theory de- 
veloped by Kempf—Ness [38], Ness [53], and Kirwan [40] in the complex setting 
and by Richardson—Slodowy [59] and Marian [48] in the real setting. We assume 
throughout that V, W are finite-dimensional real vector spaces and p : SL(V) > 
SL(W) is a Lie group homomorphism. Note that every Lie group homomorphism 
from GL(V) to GL(W) restricts to a Lie group homomorphism from SL(V) to 
SL(W), because every Lie group homomorphism from GL(V ) to the multiplicative 
group of nonzero real numbers is some power of the determinant. 

Fix a nonzero vector w € W and denote by G,, C SL(V) the connected compo- 
nent of the identity in the isotropy subgroup of w, i.e. 


4a smooth path y : [0,1] ~ SL(V) 
Gy := 4g € SL(V) | such that y(0) = 1, y(1) = g, and >. (7.5.34) 
p(y(t))w = w for0O <t <1 


By Theorem 2.5.27 this is a Lie subgroup of SL(V) with the Lie algebra 


Gw = {E € sl(V) | A(E)w = 0}. 


There are many examples of this setup that are related to interesting questions in 
geometry. The vector space W can be the space of all symmetric bilinear forms on V 
and w can be an inner product, in which case G,, is the special orthogonal group 
associated to the inner product, or w can be the quadratic form (6.4.9), in which 
case G,, is the identity component of the isometry group of hyperbolic space. Or W 
can be the space of skew-symmetric bilinear forms on V and w a symplectic form, 
in which case G,, is the symplectic linear group. Or W can be the space of skew- 
symmetric bilinear maps on V with values in V. Then w can be a cross product in 
dimension three or seven, or w can be the Lie bracket of a Lie algebra gq = V and 
then G,, is the identity component in the group of automorphisms of g. The latter 
example will be examined in detail in Sect. 7.6.2. Of particular interest are the cases 
where the group G,, is noncompact. 
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Definition 7.5.15 An inner product (-,-) on V is called (p, w)-symmetric iff 
the Lie subalgebra g,, C sI(V) is invariant under the involution A +> A*, defined 
by (v, A*v’) := (Av, v’) for v,v' € V. 


Exercise 7.5.16 Let (-,-) be a (p, w)-symmetric inner product on V. Prove that g € 
G,, implies g* € G,,. Hint: Choose a smooth path g : [0, 1] > G, with the end- 
points g(0) = land g(1) = g, and define &(t) := g(t)! g(t). Show that the initial 
value problem h(t) = &(t)*h(t), h(O) = 1, has a unique solution / : [0, 1] > Gy, 
(Exercise 2.5.36) and that h(t) = g(t)* for all ¢. 


The following theorem asserts the existence of a (p, w)-symmetric inner product 
on V under an irreducibility assumption (see [17, Theorem 2]). 


Theorem 7.5.17 Assume that the group Gy in (7.5.34) acts irreducibly on V. Then 
there exists a (p, w)-symmetric inner product on V with the following properties. 
The subgroup 


K, := Gy NSO(V) 
is connected and is a maximal compact subgroup of G,,. Moreover, every com- 
pact subgroup of Gy is conjugate in G,, to a Lie subgroup of Ky. Thus, if K 
is any maximal compact subgroup of Gy, there exists an element h € Gy such 


that K = hK,,h7'. 


Proof See Lemma 7.5.23. O 


Example 7.5.18 The hypothesis that the group G,, acts irreducibly on V cannot 
be removed in Theorem 7.5.17. Consider the case where W = V has dimension 
at least two, the homomorphism p : SL(V) — SL(V)) is the identity, and w € V is 
any nonzero vector. Then the one-dimensional linear subspace Rw C V is evidently 
invariant under the action of G,,, and there does not exist any (p, w)-symmetric 
inner product on V. 


To begin with, we will fix any inner product (-,-)y on V and define the 
space Po(V) of self-adjoint positive definite automorphisms of V with determi- 
nant one in terms of this fixed inner product. We will then use Theorem 7.5.12 to 
find an element P € Po(V) such that the inner product 


(u,v) yp = (v, Pt), (7.5.35) 


on V satisfies the requirements of Theorem 7.5.17. The proof is based on three 
lemmas. The fourth lemma restates the theorem in a modified form. 


Lemma 7.5.19 There exists an inner product (-,-)y on W such that p(SO(V)) C 
SO(W) and p(A*) = p(A)* forall A € sl(V). 
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Proof The proof follows the argument in [17, §3]. Assume without loss of gener- 
ality that V = IR” is equipped with the standard inner product, and that W = R” 
and that p : SL(m, R) — SL(m,R) is a Lie group homomorphism. We prove first 
that there exists a unique Lie group homomorphism p° : SL(m,C) > SL(n, C) 
(the complexification of p) such that 


P'|sion.R) = P: P(A + iB) = p(A) + ip(B) (7.5.36) 


for all A, B € sf{(m,R). Since SL(m, C) is connected, we can define p°(g) for 
g € GL(m,C) by choosing a smoth path a: [0,1] > SL(m,C) with the end- 
points w(0) = 1,, and a(1) = g, and taking 


e°(g):= BO), —- B(s)“"B(s) = po (a(s)'a(s)), BOO) =1,. (7.5.37) 


To verify that 6(1) is independent of the choice of w, one can choose a smooth 
map [0, 1]? > SL(m,C): (s,t) + a(s,t) satisfying a(0,t) = 1n, w(1,t) = g, 
define S := a~!d,a and T := a~!0,a, and define B : [0, 1]? > SL(n, C) as the so- 
lution of the initial value problem B-'d,8 = 6°(S), B(0,t) = 1. Since f° is a Lie 
algebra homomorphism and 0,S — 0,7 = [S, 7], it follows that B-'d,B = p°(T) 
and so B(1,t) is independent of t. Moreover, SL(m, C) retracts onto SU(m) by 
polar decomposition and so is simply connected by a standard homotopy argument. 
That the map p° : SL(m, C) — SL(n, C) thus defined is smooth follows from the 
smooth dependence of solutions on the parameter in a smooth family of differential 
equations. That it is a group homomorphism follows by catenation of paths, and 
that it satisfies (7.5.36) follows directly from the definition. 

Now consider the action of the compact subgroup SU(m) C SL(m, C) on the 
Hadamard manifold Q of positive definite Hermitian n x n-matrices Q of de- 
terminant one (Remark 6.5.21) by (g,Q)b p°(g)Op°(g)*. This action is by 
isometries and hence, by Cartan’s Fixed Point Theorem 6.5.6, there exists an el- 
ement Qo € Qo such that p°(g)Qop*(g)* = Qo for all g € SU(m). Differentiate 
this equation at g = 1,, to obtain 6°(B)Qo + Qop°(B)* = 0 for every skew- 
Hermitian matrix B = —B* € sl(m,C) with trace zero. Now let A € sl(m,R), 
define R:= $(A— A"), S := 4(A+A'), and take B = R and B =iS. Then 
P(R)QOo + QoP(R)' = 0 and 6(S)Qo = Qop(S)'. Hence the positive definite 
symmetric n x n-matrix QO := Re(Q ) satisfies 


6(A") = (-R +S) = OP(R+ S)'OT = QA(A)'O" 


for all A € sf{(m, R). This shows that the inner product (w, w’) := w'Q7'w’ on R"” 
satisfies the requirements of Lemma 7.5.19. Oo 


In the remainder of this subsection we will fix an inner product on W as in 
Lemma 7.5.19. Recall also that we have already chosen a nonzero vector w € W. 
The norm squared of this vector determines a function on the space of inner products 
on V (see [17, Lemma 1]). 
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Lemma 7.5.20 Define the function fy : Po(V) > R by 


fol?) = (uw, p(P wh, (7.5.38) 


for P € Po(V). Then an element P € Po(V) is a critical point of fy if and only 
if (p(A)w, p(P~!)w)w = 0 for all A € sl(V). Moreover, if P is a critical point 
of fw, then the inner product (7.5.35) on V is (p, w)-symmetric. 


Proof Fix an element P € Po(V) and a tangent vector Pe T,Po(V). Then it 
follows from Lemma 7.5.19 that 


df(P)P 


-(w.p(P'P)p(P“w) 


(PP), o(P='w), 


Now let A € sI(V) and take P := AP + PA* to obtain 
dfy(P)(AP + PA*) = —(6(A + PA*P7')w, p(P~')u),, 
= —2(6(A)w, p(P~")u),. 


Thus P is acritical point of /, if and only if the right hand side of (7.5.39) vanishes 
for all A € sI(V). To prove the last assertion, define the norm 


(7.5.39) 


|w'lw.p = V(w', p(P!)w’)w 
for w’ € W. Now let P € Po(g) be a critical point of f,, let & e sl(V), and 
take A := [E, PE*P—"] € sf(V) in (7.5.39). Then 
0 = (A([§, PE*P™'/)w, p(P™')w)y = [6(PE Puli. p — 1bE)wliv.p- 


If € € gy, then 6(€)w = 0, hence 6(P&* P~')w = 0, and hence P&*P-! € gy. 
But the endomorphism P&* P~! is the adjoint of € with respect to the inner prod- 
uct (7.5.35) on V and this proves Lemma 7.5.20. Oo 


Example 7.5.21 This example shows that the (o, w)-symmetry of the inner prod- 
uct (7.5.35) does not imply that P is a critical point of f,. Take W = V x V and 
let p : SL(V) — SL(W) be the diagonal action. Assume dim(V) = 2 and choose 
w = (u,v) € W such that u, v € V are linearly independent. Then G,, = {1} and so 
every inner product on V is (p, w)-symmetric (and the assertions of Theorem 7.5.17 
are satisfied), however, the function f,,(P) = (u, P~!u)y + (v, P7!v)y does not 
have any critical point. 


Let My C Po(V) be the set of minima of the function f,, : Po(V) > R in 
Lemma 7.5.20 and let Crit(fi,.) C Po(V) be its set of critical points. The next result 
shows that f, is convex and that G,, acts transitively on M,, whenever this set is 
nonempty (see [17, Lemma 2]). 
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Lemma 7.5.22 The function f, has the following properties. 


(i) fw is convex and G,-invariant, and thus M,, = Crit( fw). 

(ii) Fix two elements Py € My, and P € Po(V). Then P € M, if and only if there 
exists an element n € Qw such that n = Pon*P,' and exp(n) = PP,", or 
equivalently p(PP,'!)w =w. 

(iii) The group Gy acts transitively on My. 


Proof We prove part (i). Let y : R — Po(V) be a geodesic and define 
P:=y(0), P:=y(0), S:= Po/?P pl? 
Then, by Lemma 6.5.18, y(t) = P'/? exp(tS) P!/? and hence 


fol (t)) = (o(P~"””)w, exp(—tA(S))p(P~/?)w) w. 


This implies 


d2 
qe ro) = (p(S)p(P~'/?)w, exp(—t6(S))H(S)p(P~"")w) w = 0 


for all t and hence f,, is convex. Hence M,, = Crit(f,,) by Lemma 7.5.9. That f,, 
is G,,-invariant follows directly from the definition. This proves (i). 

We prove part (ii). If 7 € gy satisfies n = Pon*P)!, exp(n) = PP, |, then 
p(PP,')w = w. If p(PP,')w = w, then p(P~!)w = p(P,')w, hence 


(p(A)w, p(P™')w)w = (6(A)w, p(Py')w)w = 0 


for all A e sl(V), and hence P € M,, by Lemma 7.5.20 and part (i). Now as- 
sume P € M,, and let y : [0,1] > Po(V) be the unique geodesic with the end- 
points y(0) = Po and y(1) = P € M,. Then y(t) € M,, for all ¢ by Lemma 7.5.9. 
Hence (6(n)w, p(y(t)')w) w = 0 for all ¢ and all n € s{(V), by Lemma 7.5.20. 
Differentiate this equation at ¢ = 0 to obtain 


((n)w, p(Py!)6(P Py !)w)w =0, =P := 70) € Tr, Po(V). 


Take 7 := PP5! to obtain 6(7)w = 0, and thus 7 € g, and n = Pon* P)'. By 
Lemma 6.5.18 we also have P = y(1) = exp(P P5!) Po = exp(n) Po and this 
proves (ii). 

We prove part (ii). Let Po, P € M,, choose an element n € g,, as in (1i) so 
that 7Po = Pon* and P = exp(n)Po, and define h := exp(y/2) € Gy to ob- 
tain P = hPoh*. This proves (iii) and Lemma 7.5.22. O 


With these preparations we are ready to prove Theorem 7.5.17. 
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Lemma 7.5.23 


(i) IfGy acts irreducibly on V, then M, # 9. 
(ii) If P € My, then the inner product {-, P~'-)y on V satisfies all the requirements 
of Theorem 7.5.17. 


Proof The function f, is convex and G,,-invariant by Lemma 7.5.22. Hence 
part (i) follows from Theorem 7.5.12. To prove part (ii), assume that P is a crit- 
ical point of f,. Then, by Lemma 7.5.20, the inner product (-, P~!-)y is (p, w)- 
symmetric. We prove the remaining assertions in four steps. Define 


Kp := G, NSO(V,(-, P7!)y) = {ue G, |uPu* Po! = 1}. 


Step 1 Let K C G, be any compact subgroup. Then there exists an element h € Gy 
such that h~!Kh C Kp. 


By Theorem 7.5.12, there exists a Py € Po(V) such that f,(Po) = inf f,, and 
uPou* = Po for all u € K. By Lemma 7.5.22, there exists an element h € G, 
such that Pp = hPh*. Hence uhPh*u* = hPh* for all u € K, and hence the au- 
tomorphism h~!uh is orthogonal with respect to the inner product (-, P~!-)y for 
every u € K. 


Step 2 Kp is a compact connected subgroup of Gy. 


By the Closed Subgroup Theorem 2.5.27 Kp is a closed, and hence compact, sub- 
group of SO(V, (-, P~!-)y) and so is a compact Lie subgroup of G,,. We prove that 
Kp is connected. Let u € Kp C Gy. Since G,, is connected, there exists a smooth 
path g : [0,1] > G, such that g(0) = 1 and g(1) = wu. Thus, by Exercise 7.5.16, 
we have g(t) Pg(t)* P| € G, for all t. Hence, by part (ii) of Lemma 7.5.22, there 
exists a smooth path 7 : [0,1] > gy such that y(t) = Pn(t)* Po!, exp(n(t)) = 
g(t)Pg(t)*P7!, and (0) = n(1) = 0. Hence u(t) := exp(—n(t)/2) g(t) is a path 
in Kp joining u(0) = 1 tou(1) = u. 


Step 3 Kp is a maximal compact subgroup of Gy. 


Let K C G, be a compact subgroup containing Kp. Then by Step | there exists 
an h € Gy, such that h-'Kh C Kp. Hence Kp C K C hKph™! and so Lie(Kp) C 
Lie(K) C hLie(Kp)h7!. Since Lie(Kp ) and hLie(Kp )h7! have the same dimension, 
this implies Lie(Kp) = hLie(Kp)h7!. Since Kp and hKph! are connected, this 
implies Kp = hKph7! and so Kp = K. 


Step 4 Let K be a maximal compact subgroup of Gy. Then there exists an ele- 
ment h € G, such that K = hK pho". 
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By Step 1 there exists an h € G,, such that h~'Kh C Kp, thus K C hKp/h“! and 
so K = hKph=!, because K is a maximal compact subgroup of G,,. This proves 
Step 4, Lemma 7.5.23, and Theorem 7.5.17. O 


Lemma 7.5.23 shows that all minima of the function f,, in Lemma 7.5.20 give 
rise to inner products that satisfy the requirements of Theorem 7.5.17. The next 
lemma shows that the set of minima of /,, is a Hadamard manifold. 


Lemma 7.5.24 The set M,, is a geodesically convex and totally geodesic subman- 
ifold of Po(V ). Hence, if it is nonempty, it is a Hadamard manifold and a symmetric 
space. 


Proof By Lemma7.5.22 the function /,, is convex, and so M,, is geodesically con- 
vex by Lemma 7.5.9. Now assume M,, is nonempty and fix any element Py € My. 
Then the exponential map 


Tp, Po(V) > Po(V) : P + exp(P P5') Po 


is a diffeomorphism (Theorem 6.5.10 and Lemma 6.5.18) and M,, is the im- 
age of the linear subspace {P € Tp, Po(g) | PP; ' © q,,} under this diffeomor- 
phism (Lemma 7.5.22). Since Py can be chosen to be any element of M,,, this 
shows that M,, is a totally geodesic submanifold of Po(V). Moreover, the isometry 
Po(V) > Po(V): P+ bo(P) = Py P~! Po in Step 2 of the proof of Theorem 6.5.10 
satisfies 


bo(exp(P Pi') Po) = exp(—P P. Py 


for all P € TpPo(V) and so restricts to an isometry of M,,. Hence M, is a sym- 
metric space and this proves Lemma 7.5.24. Ey] 


We emphasize that Lemma 7.5.24 does not require the hypothesis that the 
group G,, acts irreducibly on V. This hypothesis was only used to prove that the 
space M,, is nonempty. The next example shows that f,, can have critical points in 
cases where G,, acts reducibly on V. 


Example 7.5.25 Let V = R?, let W = 8 C R* be the space of symmetric ma- 
trices, equipped with the standard inner product and the standard action S +> gSg" 
of SL(2, R), and let w = S := diag(1,—1) € S so that 


a b 
boa 
The action of Gg on R? is reducible, because the diagonal in R? is Gs-invariant, 
however, the function fs(P) = trace(SP~!SP') attains its minimum on the set 


o>a@—r= i} 
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Ms = Gs C P(R7), corresponding to the symmetric inner products on R?. If 
one modifies this example by taking W = 8 x § and w = (1,8), then f,,(P) = 
trace(P~? + SP-!SP—') has a unique critical point at P = 1, the group G,, = {1} 
acts reducibly on V, and the assertions of Theorem 7.5.17 are trivially satisfied for 
every inner product on V. 


Example 7.5.21 and Example 7.5.25 show that, in general, one cannot expect 
there to be a one-to-one correspondence between the minima of /,, and the (p, w)- 
symmetric inner products on V. However, we shall see below that such a one-to-one 
correspondence does exist in many cases. 


Remark 7.5.26 Once it is known, that the function f, : Po(V) > R has a 
critical point Py € Po(V), we can modify the entire setup as follows. Replace 
the inner product on V by (-,-)yo := (, Pj ':)y and the inner product on W 
by (-.-)wo := (,e(P5')-)w. This pair of inner products satisfies the require- 
ments of Lemma 7.5.19. Let Poo(V) be the space of self-adjoint positive definite 
endomorphisms with respect to the new inner product and define the func- 
tion fo : Poo(V) > R by the analogous formula. Then P € Po(V) if and only 
if PP)! € Poo(V) and fuo(PPo) = fw(P) forall P € Po(V). Thus fo attains 
its minimum at P = 1. 


In the next corollary we do not assume that G,, acts irreducibly on V. 
Corollary 7.5.27 (Cartan Decomposition) Assume the function f,, in Lemma 
7.5.20 has a critical point at Py = 1 and define 

Ky := Gy NSO(V), Pw i= {ne qul|n=n*}. (7.5.40) 
Then the map 

Ky X Pu > Gy : (U,n) + exp(n)u =: dy (u.n) (7.5.41) 
is a diffeomorphism. Hence the map Gy — Po(V) : g + ./gg* descends to a dif- 
feomorphism from the quotient space Gy/Ky to My = Gy Po(V). 
Proof By part (ii) of Lemma 7.5.22 with Po = 1 the function /, attains its mini- 
mum on the set M,, = Gy M Po(q). Now define the map 
Vw? Gy > Ky X Pw 
by Ww(g) := (u, 7), where 
ni= 7exp (gg")E py, w= exp(—n)g € Ky 


for g € G,. This map is well defined and smooth because, for every g € Gy, 
we have gg* €G,MPo(V) = M, (see Exercise 7.5.16), and the exponen- 
tial map descends to a diffeomorphism exp : py — M, (see Lemma 7.5.22). 
Since @y ° Wy = id and Wy, o dy = id, it follows that ¢, is a diffeomorphism. 
This proves Corollary 7.5.27. Oo 


7.5 Convex Functions on Hadamard Manifolds* 383 


As a warmup for the main application of Theorem 7.5.17 in Sect. 7.6 it may be 


useful to consider the following two examples. 


Exercise 7.5.28 


(i) 


(ii) 


Let V be an m-dimensional real vector space and 
W=S*V* 


be the vector space of all symmetric bilinear forms Q0 : V x V > R. De- 
fine the homomorphism p: SL(V) — SL(W) by the standard action of the 
group SL(V) on W, ie. 


(p(g)Q)(v, v’) = O(g7'v, gv’) 


for g € SL(V), O € S?V*, and v, v' € V. Assume that V is equipped with an 
inner product and an orthonormal basis e;,..., é;,, and define an inner product 
on W by (Q,Q') := 97; ; O(e, e;)Q' (ei, e;) for Q, Q' € S?V*. Show that 
this inner product is independent of the choice of the orthonormal basis and 
satisfies the requirements of Lemma 7.5.19. 

The inner product on V is an element Qo € W whose isotropy subgroup is 
the special orthogonal group SO(V ). Show that the function fo, in (7.5.38) is 
given by fo,(P) = trace(P*) for P € Po(V) and that it has a unique critical 
point at P = 1. 


(iii) Examine the case where V = R’*! is equipped with the standard inner product 


and Q € W is the quadratic form in (6.4.9). Relate this example to the isometry 
group of hyperbolic space (Sect. 6.4.4). Find a maximal compact subgroup of 
the identity component of O(m, 1). 


Exercise 7.5.29 


(i) 


Let V be a 2n-dimensional real vector space and 
W = A’V* 


be the vector space of all skew-symmetric bilinear forms t : V x V —> R. De- 
fine the homomorphism p : SL(V) — SL(W) by the standard action of the 
group SL(V) on W, i.e. 


(o(g)t)(v, v') = (gat) (v, v') = t(g'v, gv’) 


for g € SL(V), t € A*V*, and v,v’ € V. Assume that V is equipped with 
an inner product and an orthonormal basis e),...,@2,, and define an inner 
product on W by (1, 17’) := ae t(e;,e;)t'(e:,e;) for t,t’ € A?V*. Show 
that this inner product is independent of the choice of the orthonormal basis 
and satisfies the requirements of Lemma 7.5.19. 


384 


(ii) 
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Let wo: V x V — R be a nondegenerate skew-symmetric bilinear form. Then 
the pair (V, w) is called a symplectic vector space and @ is called a symplectic 
form on V. The isotropy subgroup of w in GL(V) is called the symplectic 
linear group. Denote this group and its Lie algebra by 


Sp(V, w) := {g € GL(V) | o(g-, g-) = of, 
sp(V,@) := Lie(Sp(V,w)) = {A € End(V) | @(A-,-) + @(, A-) = 0}. 


The group Sp(V, @) is connected and contained in SL(V) (see [49]). An au- 
tomorphism J : V + V is called a linear complex structure iff J* = —1. 
A linear complex structure J is called compatible with @ iff the bilin- 
ear form w(-, J-) is an inner product on V. An inner product (-,-) on V is 
called compatible with @ iff there exists a linear complex structure J such 
that w(-, J-) = (-,-). Prove that an inner product on V is compatible with w if 


and only if it satisfies the conditions (for any basis e},..., @2, of V) 
det(w(e;, e;)) = det((e;, e;)), (7.5.42) 
A€ sp(V,a) => A* € sp(V,a). (7.5.43) 


If an inner product on V is compatible with w, prove that g € Sp(V,@) 
implies g* € Sp(V,q@) (without using the fact that Sp(V,q@) is connected). 
Hint: Define J € End(V) by w(, J-) := (-,-) and show that J + J* = 0. 
Show that A € sp(V,q@) if and only if AJ + JA* = 0. Use (7.5.43) to prove 
that J? commutes with every self-adjoint endomorphism of V and hence 
satisfies J? = Al for some A € R. Use (7.5.42) to conclude that A = —1. 


(iii) Fix an inner product on V and a symplectic form w : V x V —> R that satis- 


fies (7.5.42). Then the function f,, in (7.5.38) is given by 


fo(P) = >> we, e;)o(Pe;, Pej), P EP (V), (7.5.44) 
ij 


where e},...,@2,) is an orthonormal basis of V. Prove that this is the norm 
squared of w with respect to the inner product (-, P~!-). Prove that P is a crit- 
ical point of f,, if and only if the inner product (-, P~!-) is compatible with w. 
Prove that the space J(V,@) of w-compatible linear complex structures is a 
Hadamard manifold and a symmetric space (Exercise 6.5.24). Prove that, if 
J €J(V, ), then the unitary group U(V, w, J):= {g €Sp(V, w) | gJg t= J} 
is a maximal compact subgroup of Sp(V, w) and that every compact subgroup 
of Sp(V,@) is conjugate to a subgroup of U(V,@, J). All this is of course 
well known, but this exercise shows how these results can be derived from 
Theorem 7.5.17. Moreover, it is not necessary to assume that Sp(V, w) is con- 
nected. One can start with the identity component of Sp(V, @), prove that it is 
contained in SL(V), and deduce the connectivity of Sp(V, @) from that of the 
unitary group. 
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7.6 Semisimple Lie Algebras* 


This section discusses applications of the results in Sect. 7.5.3 to Lie algebra the- 
ory, following the work of Donaldson [17]. It examines symmetric inner products 
on Lie algebras (Sect. 7.6.1), establishes their existence on simple Lie algebras 
(Sect. 7.6.2), and derives as consequences several standard results in Lie algebra 
theory, such as the uniqueness of maximal compact subgroups up to conjugation 
for semisimple Lie algebras (Sect. 7.6.3), and Cartan’s theorem about the compact 
real form of a semisimple complex Lie algebra (Sect. 7.6.4). 

Here are some basic definitions that will be used throughout this section. Let gq be 
a finite-dimensional real Lie algebra (Definition 2.4.22). A vector subspace  C g 
is called an ideal iff [€, 7] € h for every & € g and every 7 € bh. The Lie algebra g 
is called abelian iff the Lie bracket vanishes. It is called simple iff it is not abelian 
and does not contain any ideal other than h = {0} and h = g. Examples of ideals 
in any Lie algebra g are the center Z(q) (Exercise 2.5.34) and the commutant [q, g] 
(Exercise 5.2.24), defined by 


Z(g) = {& € g|[E. 7] = 0 forall 7 € g}, 

[g. 9] := span{[§. n]|&. € g}. 
Recall also the definition of the adjoint representation ad : g — Der(g) in Exam- 
ple 2.5.23 by ad(&) := [&,-] for & € g and the definition of the Killing form x : 
g x g > R in Example 5.2.25 by «(&,) := trace(ad(&)ad(n)) for —,n € g. 
Let Auto(q) be the connected component of the identity in the group Aut(g) of 


automorphisms of g. This is a Lie subgroup of GL(q) whose Lie algebra is the 
space Lie(Auto(q)) = Der(g) of derivations on gq. 


7.6.1 Symmetric Inner Products 
In [17] Donaldson introduced the following notion. 


Definition 7.6.1 Let g be a finite-dimensional real Lie algebra. An inner prod- 
uct (-,-) on g is called symmetric iff it satisfies the condition 


6 € Der(g) => 5* € Der(g). (7.6.1) 


Here 6* : g — g denotes the adjoint of the endomorphism 6 with respect to the 
inner product, i.e. it satisfies (&,5*n) = (6&, 7) for all €, 7 € g. 


Exercise 7.6.2 Let g be a finite-dimensional real Lie algebra equipped with a 
symmetric inner product. Prove that g € Auto(g) = > g* © Auto(q). Hint: See 
Exercise 7.5.16. 


Some consequences of the existence of a symmetric inner product are derived in 
Lemma 7.6.8 below. To begin with we discuss some examples. 
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Every vector space endomorphism of a Lie algebra g that takes values in the 
center Z(q) and vanishes on the commutant [g, g] is a derivation. Conversely, every 
derivation that takes values in Z(q) necessarily vanishes on [g, g]. The space of 
such derivations is an ideal in Der(g), denoted by 


Derz (gq) := {6 € Der(g) | im(8) C Z(g)}. (7.6.2) 


Example 7.6.3 Consider the abelian Lie algebra g = R”. The adjoint representa- 
tion is trivial and Der(q) = gl(m, R) = R’”””” is the Lie algebra of all vector space 
endomorphisms of g. Thus Der(q) = Derz(q), the Killing form on g vanishes, and 
every inner product on q is symmetric. 


Example 7.6.4 Consider the Lie algebra g = gl(m,R) with [g, g] = sl(m,R) 
and Z(q) = R1. It satisfies g = [g, gq] 6 Z(q) and Der(q) = ad(q) ® Derz(qg), 
where Derz(g) C Der(q) is the one-dimensional subspace generated by the deriva- 
tion 67(A) = trace(A)1 (whose trace is m). Moreover, the standard inner prod- 
uct (A, A’) = trace(A'A’) on g is symmetric and the kernel of the Killing form 
K(A, A’) = 2mtrace(AA’) — 2trace(A)trace(A’) is Z(g). 


Example 7.6.5 Consider the Lie algebra g = gl(m, R) x R” with the Lie bracket 
[(A, v), (A’, v')] := ([A, A], Av! — A'v). This Lie algebra can be identified with 
the space of all affine vector fields on R”. It has a trivial center and the com- 
mutant [g, g] = sl(m, IR) x R’” has codimension one. Moreover, trace(ad(A, v)) = 
trace(A) for every (A, v) € g, the kernel of the Killing form «((A, v), (A’, v’)) = 
(2m + 1)trace(AA’) — 2trace(A)trace(A’) is the abelian ideal {0} x IR”, and the 
adjoint representation ad : gq — Der(q) is a Lie algebra isomorphism. 


Example 7.6.6 Consider the Heisenberg algebra h = V x R of a symplectic 
vector space (V,w) with the Lie bracket [(v,f), (v’, t’)] = (0, @(v, v’)) (see Ex- 
ercise 2.5.15). It satisfies Z(h) = [5, h] = {0} x R and the Killing form vanishes. 
Every derivation on h has the form 6(v,t) = (Av +Av, A(v) + 2At), where A € R, 
A e€V*,and A € sp(V,@). The subspace ad(h) = Derz() consists of all deriva- 
tions of the form é(v,t) = (0, A(v)). 


Example 7.6.7 The Heisenberg algebra of a symplectic vector space (V, @) ex- 
tends to a Lie algebra g = sp(V,@) x V x R with the Lie bracket 


[(A, v,t), (A’, v', t)] = ([A, A], Av’ — A’v, a(v, v’)). 
It satisfies [g, g] = g and has a one-dimensional center Z(q) = {0} x {0} x R. The 
kernel of the Killing form is the Heisenberg algebra and the adjoint representa- 


tion ad : g > Der(g) is surjective. 


The next lemma shows that the Lie algebras in Examples 7.6.5, 7.6.6, and 7.6.7 
do not admit symmetric inner products. 
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Lemma 7.6.8 Let g be a finite-dimensional real Lie algebra equipped with a sym- 
metric inner product. Then 


Der(g) = ad(g) © Derz(q), g = [9,9] © Z(g), (7.6.3) 


the kernel of the Killing form k : g x g — R is the center of g, and there exists an 
involution g — g : & + &* such that, for all €, € q, 


ad(é") = ad(§)", [§. mJ” = [n", &°]. (7.6.4) 


Proof Consider the orthogonal decomposition 
Der(g) = A OB, A := ad(q), R= Ar, (7.6.5) 


with respect to the inner product (6, 6’) = trace(6*6’) for 5,6’ € Der(q). Since 
[5, ad(€)] = ad(5€) for 6 € Der(q) and & € g, the subspace A is an ideal in Der(q). 
Moreover, if 6 € B and ¢ € Der(q), then e* € Der(g), hence trace([e, 6]*ad(&)) = 
trace(5*[e*, ad(E)]) = trace(6*ad(e*&)) = 0 for all € € g, and hence [e, 6] € B. Thus 
B is also an ideal in Der(q). 

Next define Derz(q)* := {5* | 5 € Derz(q)}. We prove that 


B = Derz(g) = Derz(g)*. (7.6.6) 


Since A and B are ideals we have [6, 6’] = 0 for all 6 € B and 5’ € A. Thus ad(6£) = 
[5, ad(€)] = 0 for all 6 € B and & € g, hence im(5) C Z(q) for all 5 € B, and so 
B C Derz(q). Now let 6 € Derz(g)*. Then 6* € Derz(q), hence [g, g] C ker(6*), 
hence trace(6*ad(&)) = 0 for all € € g, and so 6 € B. Thus Derz(g)* C BC Derz(g) 
and so (7.6.6) holds for dimensional reasons. The first equation in (7.6.3) follows 
directly from (7.6.5) and (7.6.6). It follows also from (7.6.6) that trace(6*ad(&)*) = 
trace(ad(£)5) = 0 for all € € g and all 5 € B, and so ad(&)* € Bt = A forallE eg. 
Thus 


SEA = S* 6A. (7.6.7) 


By (7.6.7), an element ¢ € g belongs to Z(qg) if and only if ad(&)*¢ = 0 for 
all € € g if and only if (¢, ad(€)n) = 0 for all €,7 € g, if and only if ¢ € [g, g]+. 
Thus [g, g]+ = Z(q) and this proves the second equation in (7.6.3). 

By (7.6.3), the map ad: g — Der(q) restricts to a Lie algebra isomorphism 
from [g, g] to A. Hence, by (7.6.7) there exists a unique involution [g, g] — [g, g] : 
— +> &* that satisfies (7.6.4) for all €, 7 € [g, g]. By (7.6.3) this involution extends 
uniquely to an involution g > q: & b> &* such that ¢* = ¢ for all € € Z(q), and the 
extended involution satisfies (7.6.4) for all €,n € q. 

Now let ¢ € g belong to the kernel of the Killing form, ie. «(€,&) = 0 for 
all € € g. Then |ad(¢)|? = «(¢, 6*) = O and hence ¢ € Z(q). Conversely, it follows 
directly from the definitions that Z(q) is contained in the kernel of the Killing form 
and this proves Lemma 7.6.8. Oo 
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The goal of this section is to establish the existence of symmetric inner products 
on simple Lie algebras. First, the following lemma derives some immediate conse- 
quences of the definition of a simple Lie algebra. 


Lemma 7.6.9 Let g be a finite-dimensional simple real Lie algebra. Then the 
center of g is trivial, the adjoint representation ad : g — Der(q) is injective, the 
commutant is [g, g] = g, and trace(ad(&)) = 0 for all E € g. 


Proof The center Z(q) is an ideal in q. It is not equal to g because g is not abelian, 
and hence Z(g) = {0} because g is simple. The subspace [g, g] is also an ideal in g. 
It is nonzero because g is not abelian, and hence [g, g] = g because g is simple. The 
adjoint representation ad : g — Der(g) is injective because its kernel is the center 
of g. The last assertion follows from the fact that [g, g] = q and trace(ad([é ; n))) — 
trace([ad(&), ad(n)]) = 0 for all €, 7 € g. This proves Lemma 7.6.9. oO 


That the Killing form of a simple Lie algebra is nondegenerate is a deeper result 
that does not follow directly from the definition. In [17, Theorem 1] Donaldson 
deduces the existence of symmetric inner products on simple Lie algebras from 
Theorem 7.5.17 and derives as corollaries various standard results in Lie algebra 
theory, including nondegeneracy of the Killing form. 


Theorem 7.6.10 Let g be a finite-dimensional simple real Lie algebra. Then the 
Killing form on g is nondegenerate, the adjoint representation ad : g — Der(g) is 
bijective, every derivation 6 : ¢ — q has trace zero, and every automorphism in the 
identity component Autg(g) has determinant one. 


In particular, Theorem 7.6.10 establishes for every simple Lie algebra g the exis- 
tence of a connected Lie group Autg(g) C SL(q) whose Lie algebra is isomorphic 
to gq. 


Theorem 7.6.11 (Donaldson) Every finite-dimensional simple real Lie algebra 
admits a symmetric inner product. Moreover, if SO(g) is the special orthogonal 
group associated to a symmetric inner product on q, then 


K := Auto(g) N SO(g) (7.6.8) 
is connected and is a maximal compact subgroup of Auto(g), every compact sub- 


group of Auto(q) is conjugate to a Lie subgroup of K, and every maximal compact 
subgroup of Auto(g) is conjugate to K. 
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Given an inner product (-,-) on any m-dimensional Lie algebra g and an or- 
thonormal basis e1,...,@m of g, define the function fg : Po(g) > R by 


fa(P) = ¥ (ler), P Pe, Pe) (7.6.9) 


i,j 


for P € Po(g). The right hand side of (7.6.9) is independent of the choice of the 
orthonormal basis and is the norm squared of the Lie bracket with respect to the 
inner product (-, P~!-) on g. 


Theorem 7.6.12 (Donaldson) Let g be a finite-dimensional simple real Lie alge- 
bra equipped with a symmetric inner product. Then the set 


My, := {P € Po(g) | dfg(P) = o} = {P € Po(g) | fa(P) = inf fy} 
= Po(g) NM Aut(g) = {exp(5) | 8 € Der(g), 6 = 8*} (7.6.10) 
a iP € Po(g) | the inner product (-, P~'-) is symmetric} 


of critical points of fg is a geodesically convex and totally geodesic submanifold of 
Po(g). Hence it is a Hadamard manifold and a symmetric space. 


The proofs require three preparatory lemmas. We do not assume that every 
derivation has trace zero. Thus it is necessary as an intermediate step to intro- 
duce the subspace Derp(q) := {5 € Der(g) | trace(S) = 0}. We will consider inner 
products on q that satisfy the condition 


6 € Dero(g) => 5* € Dero(g). (7.6.11) 


Lemma 7.6.13 Let g be a finite-dimensional real Lie algebra satisfying the condi- 
tions Z(g) = {0} and [g, g] = g, and fix an inner product (-,-) on g. Then the inner 
product is symmetric if and only if it satsfies (7.6.11). Moreover, if such an inner 
product exists, then Der(q) = ad(g) = Dero(g). 


Proof Assume (7.6.11) and consider the decomposition Derp(q) = Ap ® Bo, 
where Ag := ad(g) C Dero(g) (because [g,g] = g) and Bo := Aj are ideals 
in Dero(g) as in the proof of Lemma 7.6.8. Then ad(5&) = [6, ad(€)] = 0 for 
all 5 € Bo and all & € g. Since Z(g) = {0}, this implies By = 0, and hence the 
adjoint representation ad : g — Dero(q) is bijective. By (7.6.11), this implies the 
existence of an involution € +> &* such that ad(&*) = ad(&)* for all & € g. Since 
the adjoint representation ad : gq — Der(q) is injective, this in turn implies that the 
Killing form is nondegenerate and so Der(q) = ad(g) = Dero(g) by Lemma 7.4.3. 
Thus the inner product on g is symmetric. Conversely, if the inner product on g 
is symmetric, then it satisfies (7.6.11) because 6 and 6* have the same trace. This 
proves Lemma 7.6.13. Ey 
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Lemma 7.6.14 Let g be a finite-dimensional simple real Lie algebra with a sym- 
metric inner product, and let g > g : & + &* be the unique involution that satis- 
fies (7.6.4). Then there exists a constant c > 0 such that 


K(E*,n) =cl&,n) — forall&,n eg. (7.6.12) 


Proof By Lemma 7.6.9 the adjoint representation ad : g — Der(g) is injective. 
Hence the map g x g > R: (&, 7) tb «(&*, ) = trace(ad(€)*ad(n)) is an inner 
product on g. Thus there exists a self-adjoint positive definite vector space iso- 
morphism A : g — g such that «(&*, 7) = (AE, n) for all &, 7 € g. Letc > 0 be an 
eigenvalue of A and define } := {n eg | An = en}. We prove that is an ideal in g. 
Let € € g and 7 € §. Then, for all ¢ € q, 


(A[E, nm]. 6) = «(LE n]*. 0) = «([n*, €°1, 0) = «(n", [E*, 6) = (An, [E*, €1) 
= (An, ad(&")¢) = (ad(§) An, ¢) = ([E, An], ) = c([. 7]. 6). 


Hence A[E, 7] = c[E, n] and so [E, n] € §. This shows that h is a nonzero ideal and 
hence h = gq. Thus A = cl and this proves Lemma 7.6.14. O 


Given any inner product on gq, call an element P € Po(q) symmetric iff the 
inner product (-, P~!-) on g is symmetric. Thus every symmetric element P € P(g) 
determines an involution tp : Der(q) > Der(g) given by tp (8) := P5* P~'. Denote 
its determinant by ep := det(tp) € {—1, +1}. 


Lemma 7.6.15 Let g be a finite-dimensional simple real Lie algebra, choose 
any inner product on g, and let P, Po € Po(g) be symmetric elements. Then 
Pr, € Aut(g). 


Proof The composition of the involutions tp and tp, is the Lie algebra auto- 
morphism tp © tp,(8) = P(Pod* P5!)" P-!| = PP; 15PoP~! for & € Der(g). By 
Lemma 7.6.8 and Lemma 7.6.9 the very existence of a symmetric inner product 
on g implies that the adjoint representation ad : g — Der(g) is a Lie algebra iso- 
morphism. Hence there exists a g € Aut(q) such that 


ad(g&) = PP) 'ad(€) PoP! for all € € g. (7.6.13) 
Since ad(g&) = gad(&)g™!, this automorphism satisfies the equations 


gPP.'E.m=(E.g 'PPy nl]  forallé,n eg, (7.6.14) 


det(g) = epep, € {-1, +]}. (7.6.15) 


Since Po is symmetric, it follows from Lemma 7.6.8 and Lemma 7.6.14 that there 
exists an involution g > g : & + &* and aconstant c > 0 such that 


ad(&*) = Poad(&)*Py', —«(&*, n) = c(&, Po''n) (7.6.16) 
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for all €&, 7 € gq. By (7.6.13) and (7.6.16) we have 


«((g&)*,n) = trace(Poad(gé)* Py 'ad(7)) 

= trace((PP, !ad(&) Po P~!)* Py !ad(n) Po) 

= trace(P~! Poad(€)* Py ' PPy 'ad(n) Po) 

= trace(ad(é*) P Py 'ad(n) Po P~') 

= trace(ad(&*)ad(gn)) 

= «(&*, gn). 
This shows that g is self-adjoint and positive definite with respect to the inner 
product (-, P)!-), and so det(g) = 1 by (7.6.15). Since PP,! is self-adjoint and 
positive definite with respect to the same inner product, the vector space isomor- 
phism g PP, lg? (p a a a ' 9-1/2) ¢!/2 has only positive real eigenvalues. 
Let A > 0 be one such eigenvalue. Then the eigenspace ker(Al — g-!PP,') is a 
nonzero ideal in g by (7.6.14), and so is equal to g, because g is simple. Thus 


ao PP = AI, and since g, P, Po all have determinant one, it follows that A = 1 
and so PP, | = g € Aut(q). This proves Lemma 7.6.15. O 


Proof of Theorems 7.6.10, 7.6.11, and 7.6.12 We use the results of Sect. 7.5.3 
in the situation where V := g is the Lie algebra itself and W := Ag* @ g is the 
space of all skew-symmetric bilinear maps t : g x g > q. The Lie group homo- 
morphism p : SL(q) — SL(W) is given by the standard action of the group SL(q) 
on W, i.e. 


(p(g)t)(E,n) = gt(g'é.g 'n) 


for g € SL(g), t € W, and &, n € gq. Fix an inner product (-, -) on g and an orthonor- 
mal basis é},..., @m of g, and define 


(o,t)w := > (o(e;.e;). t(6.e;)) (7.6.17) 


i 


for o,t € W. This inner product satisfies the requirements of Lemma 7.5.19, 
i.e. p(A*) = p(A)* for all A € sI(q). The vector w := [-,-] € W is chosen to be the 
Lie bracket. This vector is nonzero because gq is not abelian. The isotropy subgroup 
of w is the group Aut(q) M SL(g) of all automorphisms of determinant one. Denote 
its identity component by G C Auto(g) N SL(g). This is a Lie subgroup of SL(q) 
with the Lie algebra Lie(G) = Dero(g). 


Claim 1 G acts irreducibly on q. 
Let h C g be a subspace that is invariant under the action of G. Then 6h C 6 for 


every 6 € Dero(q). By Lemma 7.6.9 this implies ad(€)h C § for all € € g, so § is 
an ideal. Thus h = {0} or h = g because g is simple. 
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Claim 2 f, is convex and G-invariant and has a critical point. Moreover, every 
critical point P_ € Po(g) is symmetric, and Dero(g) = Der(g). 


In the present setting the function /,, in Lemma 7.5.20 agrees with the function fg 
in (7.6.9) and G,, = G. Hence fg is convex and G-invariant by Lemma 7.5.22, and G 
acts irreducibly on g by Claim 1. Thus Lemma 7.5.23 asserts that fg has a critical 
point. Let P € Po(qg) be a critical point of fg. Then by Lemma 7.5.23 the inner 
product (-, P~!-) satisfies (7.6.11) and so, by Lemma 7.6.13, this inner product is 
symmetric and Der(q) = Dero(g). 


Claim 3 Fix a critical point Py € Po(q) of fg and any element P € Po(q). Then 
the following are equivalent. 


(a) P is acritical point of fg. 

(b) fa(P) = inf fy. 

(c) PP5' € Aut(g). 

(d) There exists a & € Der(g) such that 6 = Pod* P,', exp(5) = PP,'. 
(e) The inner product (-, P~-) on g is symmeric. 


Since fg is convex by Claim 2, the equivalence of (a) and (b) follows from 
Lemma 7.5.9. Since Der(q) = Dero(q) by Claim 2, the equivalence of (b), (c), 
and (d) follows from part (ii) of Lemma 7.5.22. Moreover, (a) implies (e) by 
Claim 2, and (e) implies (c) by Lemma 7.6.15. This proves Claim 3. 

The existence of a symmetric inner product on gq was proved in Claim 2. Thus 
the nondegeneracy of the Killing form follows from Lemma 7.6.8. The remain- 
ing assertions of Theorem 7.6.10 are direct consequences of the nondegeneracy 
of the Killing form. In particular, the adjoint representation ad: g — Der(g) 
is bijective and Der(g) C sl(g) by Lemma 7.4.3. Hence Auto(g) C SL(qg) and 
so Gy = G = Auto(q) in the notation of Sect. 7.5.3. 

Now fix a symmetric inner product on g. Then Pp = 1 is a critical point 
of fg by “(e) => (a)” in Claim 3. Hence the assertions about the subgroup 
K = Auto(q) MN SO(q) in Theorem 7.6.11 follow from Lemma 7.5.23. The equal- 
ities in (7.6.10) follow from the equivalence of (a), (b), (c), (d), (e) in Claim 3 
with Po = 1, and the remaining assertions about the space Mg in Theorem 7.6.12 
follow from Lemma 7.5.24. This completes the proof of Theorems 7.6.10, 7.6.11, 
and 7.6.12. O 


7.6.3 Semisimple Lie Algebras 
The following theorem characterizes semisimple Lie algebras in terms of symmetric 
inner products. First observe that, if h is an ideal in a finite-dimensional real Lie 


algebra g, then the subspace 


bh! := {n' € g| k(n, 1’) = 0 for all n € 5} (7.6.18) 
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is also an ideal, because every 7’ € b’ satisfies x(n, [&, ’]) = «([n, €], n’) = 0 for 
all € € gq and all 7 € b, and so [€, 7'] € bh’ forall E € g. 


Theorem 7.6.16 (Semisimple Lie algebras) Let g be a finite-dimensional real 
Lie algebra. Then the following are equivalent. 


G) g has a trivial center and admits a symmetric inner product. 

(ii) The Killing form k : g x g — R is nondegenerate. 

(iii) If C g is an ideal, thong =h@ 4%’. 

(iv) g is a direct sum of simple ideals. 

(v) There exists an inner product (-,-) on g, an involution g > g: & H &*, anda 
constant c > 0 such that, for all €, € q, 


ad(§) = ad(&)", [E.ml" = [n",€*],  «(E*,n) =c(E,n). (7.6.19) 


Definition 7.6.17 A real Lie algebra g is called semisimple iff it is finite- 
dimensional and satisfies the equivalent conditions in Theorem 7.6.16. 


Proof of Theorem 7.6.16 ‘That (i) implies (ii) was shown in Lemma 7.6.8. 
We prove that (ii) is equivalent to (iii). Assume first that the Killing form is 
nondegenerate and let h C g be an ideal. Then 


vel; eh = [n, '] = 0, (7.6.20) 


because «([n, 1’], &) = «(y, [n’, €]) = 0 for all € Eg, all 7 © b, and all 7’ € §. 
Now let 7€6M 6’. Then 7! := ad(&)ad(n)é = [&,[n, C]] € b’ for all &,f eg. 
Hence, by (7.6.20) we have (ad(€)ad(n))*¢ = [€, [n, n']] = 0 for all €,¢ € g. Thus 
(ad(E)ad(n))* = 0 and so x(E,n) = trace(ad(&)ad(n)) = 0 for all € € g. This 
implies 7 = 0 by nondegeneracy of the Killing form. Thus )  b’ = {0} and so 
g = 6 @ ’ because dim(h) + dim(h’) = dim(q). This shows that (ii) implies (iii). 
To prove the converse, take = g so that h’ = {0} is the kernel of the Killing form. 

It follows from (ii) and (iii) that, for every ideal h C gq, the Killing forms of 
and h’ are both nondegenerate. Hence an induction argument shows that (iii) im- 
plies (iv). 

We prove that (iv) implies (v). Assume that g = qi ®--- @ gq; is a direct sum of 
simple ideals g; C g. Fix an index j € {1,...,7} and denote by x; : gj x gj > R 
the Killing form of g;. By Theorem 7.6.11 the Lie algebra g; admits a sym- 
metric inner product (-,-);. Hence, by Lemma 7.6.8 there exists an involu- 
tion gj > gj : & + &* that satisfies (7.6.4), and by Lemma 7.6.14 there exists a 
constant c; > 0 such that «;(&*, n) =; (&, 7); for all €, 7 € q;. Thus the inner prod- 
uct (, 9) := D0; cj (§&}, mj); for &},nj € gj and§ = & +---+&,.n =m t+-+-+ 0 
satisfies the requirements of part (v) with c = 1 and &* := &f +--+ + &*. 

If (v) holds, then the Killing form is nondegenerate, hence Z(q) = 0 and 
Der(g) = ad(g) by Lemma 7.4.3, and hence the inner product in (v) is symmetric. 
Thus (v) implies (i) and this proves Theorem 7.6.16. Oo 
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Corollary 7.6.18 (Cartan Involution) Let g be a nonzero semisimple real Lie 
algebra equipped with a symmetric inner product and let & +> &* be the involution 
in Lemma 7.6.8. Then the map 


g>g:&b—é& (7.6.21) 


is a Lie algebra homomorphism (called a Cartan involution). The Cartan involu- 
tion (7.6.21) gives rise to a splitting 


g=fOp, F:={EeglE+E*=0}, p:={neg|n=n*}, (7.6.22) 
such that 
[f,f] c &, [f.p] Cp, Ip, p] CP. (7.6.23) 


Moreover, € is nontrivial, the Killing form k : g x g — R is negative definite on € 
and positive definite on p, and k(—,n) = 0 for all E € € and all n € p. 


Proof That the map (7.6.21) is a Lie algebra homomorphism follows directly 
from (7.6.4). It follows also from (7.6.4) that the subspaces , p C q in (7.6.22) 
satisfy (7.6.23). That g is the direct sum of these subspaces, follows from the iden- 
tity ¢** = ¢ for € € g, which implies 


C=E+n, Ere EE-S)ek, niHs(o+S)ep. 


By (7.6.23) the summand f must be nontrivial, because g is nonzero and hence is 
not abelian. 
Next observe that the formula (A, B) := trace(A* B) defines an inner product 


on End(q) with the norm | A] := ,/trace(A* A), and that 
K(&,n*) = K(&*,n) = trace(ad(&)*ad(n)) = (ad(&), ad(n)). 


For € € € and n € p this implies «(&,£) = —|ad(€)|*, «(n,n) = |ad(n)|?, and 
K(E, n) = 0. This proves Corollary 7.6.18. Oo 


Lemma 7.6.19 Let g be a semisimple real Lie algebra equipped with a symmetric 
inner product, let § t+ &* be the involution in Lemma 7.6.8, let g = g; ®--- ® gy 
be a decomposition into simple ideals g;, and let #, p C q be as in Corollary 7.6.18. 
Then the following holds. 


(i) The simple ideals g; C g are pairwise orthogonal. 

(ii) Each ideal q; is invariant under the involution € > &*. 

(iii) p is the orthogonal complement of f. 

(iv) For each j the restriction of the inner product to gj is symmetric and g; = 
£; @ pj, where €; = £€N gj, pj = pq; are as in Corollary 7.6.18. 
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Proof We prove part (i). For each i the orthogonal complement ar is an ideal, be- 
cause ([&, 7]. ¢) = (n. [&*, €]) = 0 for all € € g, 1 € gj, ¢ € gi, and so [&, n] € 97 
for all & € g, n € g}. This implies that hj c= iz gj is an ideal, and so is the 
subspace h; 1 g;. So either h; 1g; = {0} or 5; = g;, because g; is simple. 
If 6; Ag; = {O}, then [&, y] = 0 for all € € g; and all n € b;, hence g; C Z(q), 
and this is impossible because the center of g is trivial. Thus h; = g; for all 7 and 
this proves (i). 

We prove part (ii). The subspace g7 := {7*|7 € gj} is an ideal, because 
[§. n*] = [n. &*]* € gF for all € € g and all 7 € g;. By part (i) we have ([&, 7*], ¢) = 
—(&, [n, €]) =0 for all & € g; and 7 € g; withi # j, and all ¢ € g. Hence [g;, g7] =0 
fori # j. Hence the ideal g7  g; cannot be zero, because otherwise [g;, g;] = 0 
and so g; C Z(q). Hence g; = g;, because g; is simple. This proves (ii). 

We prove part (iii). By (ii) and Lemma 7.6.14 the involution & +» &* preserves 
the inner product on g;, and so by (i) it preserves the inner product on all of g. 
Hence (&,7) = (&*,n*) = —(&,) for all € € £ and all 7 € p. Thus f and p are 
orthogonal to each other and this proves (iii). 

Part (iv) follows directly from (ii) and this proves Lemma 7.6.19. Oo 


Theorem 7.6.20 Let g be an m-dimensional real Lie algebra that is not abelian 
and fix an inner product on g and an orthonormal basis e,,...,€m of g. Then the 
following are equivalent. 


(i) P = lis acritical point of fy. 
(ii) There exists a real number c such that 


m 


> (2ad¢e )*ad(e;) — ad(e;)ad(e:)*) =cl. (7.6.24) 


i=1 


(iii) g is semisimple, the inner product is symmetric, and there exists an involu- 
tion g > @: &  &* andaconstant c > 0 such that (7.6.19) holds. 


Proof By Lemma 7.5.20 the element P = 1 € Po(q) is a critical point of the func- 
tion fg in (7.6.9) if and only if, for all A € sI(q), 


0= Y° (le. ej]. Ale;. ej] — [Aer. e;] — le, Aej]) 
ijel 
—  trace(ad(e;)" Aad(e:) - 2ad(e;)*ad(e:)A). 


i=1 


This holds if and only if there exists a constant c € R that satisfies (7.6.24). Thus 
we have proved that (i) is equivalent to (ii). 

We prove that (i) and (ii) imply (iii). Since P = 1 is a critical point of fg, 
Lemma 7.5.20 asserts that the inner product on q is symmetric. Thus by 
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Lemma 7.6.8 there exists an involution g > gq: &+> &* that satisfies (7.6.4). 
Hence by (ii) there exists a real number c such that 


m 


0,:= > (2ad(e?)ad(e;) . ad(e;)ad(e})) =cl. (7.6.25) 


i=1 


Since g is not abelian, the endomorphism Qg, has a positive trace, so c > 0. 
This implies that the center of g is trivial, because Z(q) C ker(Q,). Hence, by 
Lemma 7.6.8, the Killing form on g is nondegenerate. By Lemma 7.6.19 this im- 
plies that the decomposition g = £ @ p in Corollary 7.6.18 is orthogonal. Hence the 
orthonormal basis e€1,..., @) of g can be chosen such that e;,..., ex is a basis of f 
and €¢41,...,@m is a basis of p. Thus e* = —e; fori < k and e* = e; fori > k, 
and so it follows from (7.6.25) that 


m 


y\ ad(e7)ad(e;) = cl. (7.6.26) 


i=1 


Hence «(E*,9) = Dy(e;.ad(E*)ad(ne:) = 2, (E.ad(e*)ad(e;)n) = c(&.n) for 
all €&, 1 € q. This proves (iii). 

That (iii) implies (11) follows by reversing this argument. By (iii) the Killing 
form is nondegenerate and the inner product is symmetric and is preserved by the 
involution & ++ &*. Thus the splitting gq = f @ p is orthogonal, and hence the or- 
thonormal basis e1,..., @ of g can be chosen such that e;,...,e, is a basis of f 
and €,41,...,@m is a basis of p. Moreover, 


m m 


Y“(E, ad(e# ad(e;)n) = } (e;, ad(E*)ad(ne;) = «(E*.n) = c(E.n) 


i=] i=1 


for all &, € g by (7.6.19). This implies (7.6.26). Since ef = +e; for all i, equa- 
tion (7.6.24) follows from (7.6.26). This proves Theorem 7.6.20. Oo 


Corollary 7.6.21 (Cartan Decomposition) Let g be a semisimple real Lie algebra 
equipped with a symmetric inner product, let g > g : § +> &* be the involution in 
Lemma 7.6.8, and define 


K := Auto(qg) N SO(q), Der*(g) := {6 € Der(g) | 6 = 8°}. 
Then the following holds. 


(i) K is connected and is a maximal compact subgroup of Auto(q), every compact 
subgroup of Auto(g) is conjugate in Auto(g) to a Lie subgroup of K, and every 
maximal compact subgroup of Auto(g) is conjugate to K. 

(ii) The map 


K x Der*(q) > Auto(q) : (u, 8) H exp(d)u 


is a diffeomorphism. 
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(iii) If there exists ac > 0 such that k(&*,n) = c(&,n) for all &,n € g, then 
Mg, := Crit( fg) = Po(g) N Aut(g) = {exp(S) | 6 € Der* (g)} 


is a totally geodesic and geodesically convex submanifold of Po(g) and so is a 
Hadamard manifold and a symmetric space, and the map 


Auto(g) > Po(g): gh Vgeg* 


descends to a diffeomorphism from the quotient space Auto(q)/K to Mg. 


Proof By Theorem 7.6.16 there exists a splitting g = g; ®---@ gq, into simple 
ideals, this splitting is preserved by every derivation of g, and by Lemma 7.6.19 
it is also preserved by the involution € +> &*. Thus Auto(q) is isomorphic to the 
product of the groups Auto(q;), and K = Auto(q) M SO(q) is isomorphic to the 
product of the subgroups K; := Auto(q;) M SO(q;). Hence part (i) follows from 
Theorem 7.6.11. Moreover, by Lemma 7.6.19, we have p = p; ®:-: ® p,;, and so 
part (ii) follows from Corollary 7.5.27. 

Under the assumptions of (iii) Theorem 7.6.20 asserts that Po = 1 is a criti- 
cal point of fg, so (iii) follows from Lemma 7.5.22, Lemma 7.5.24, and Corol- 
lary 7.5.27. This proves Corollary 7.6.21. O 


Remark 7.6.22 The Lie algebra of the group K = Autg(q) MN SO(g) in Corol- 
lary 7.6.21 is given by Lie(K) = {ad(&) | & € £} (see Corollary 7.6.18). If the sum- 
mand p in (7.6.22) is trivial, then Auto(g) = K is a compact Lie group. If p is 
nontrivial, then the quotient space Auty(q)/K is a nontrivial Hadamard manifold 
diffeomorphic to p. 


Remark 7.6.23 One can replace the space Po(g) of positive definite self-adjoint 
vector space isomorphisms P : g — g of determinant one by the space 1H, of in- 
ner products on qg with a fixed determinant (Remark 6.5.11). This eliminates the 
dependence on the background inner product and there is then only one function 
Tg: HCg — R whose set of minima is the totally geodesic submanifold Mg C Hg of 
all symmetric inner products on g with a fixed determinant that satisfy part (i11) of 
Theorem 7.6.20. The main result asserts that, when g is not abelian, the space Mg 
is nonempty if and only if g is semisimple (Theorem 7.6.16 and Theorem 7.6.20). 


7.6.4 Complex Lie Algebras 


A complex Lie algebra is a complex vector space g equipped with a Lie bracket 
gxg—>q: (&,n) & [E, n] that is complex bilinear, i.e. it is a skew-symmetric 
bilinear map that satisfies the Jacobi identity and 


[i§. n] = [&.in] = il. n] 


for all &,7 € q. Thus every complex Lie algebra is also a real Lie algebra. 
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Let g be a finite-dimensional complex Lie algebra. A complex ideal in g is a 
complex linear subspace } C g that satisfies [&, 7] € h for all € € g and all n € b. 
The complex Lie algebra g is called simple iff it is not abelian and has no com- 
plex ideals other than h = {0} and h = g. It is called semisimple iff it is finite- 
dimensional and the complex Killing form x° : g x g — C, defined by «°(E, ):= 
trace‘ (ad(&)ad()) for &, 7 € g, is nondegenerate. 

Since k = 2Rex‘, a complex Lie algebra is semisimple if and only if it is 
semisimple as a real Lie algebra. The next lemma shows that the analogous asser- 
tion holds for simple complex Lie algebras (see [17, Lemma 7]). 


Lemma 7.6.24 A finite-dimensional complex Lie algebra g is simple if and only if 
it is simple as a real Lie algebra. 


Proof Let gq be a simple complex Lie algebra of complex dimension n and 
let h C g be a real linear subspace of g that satisfies [E, 7] € § for all € € g and 
all 7 € h. Then the subspaces 


5 ib, 5 + ib 
are complex ideals in gq, and their real dimensions satisfy the equation 
dim® (6 0 ib) + dim® (h + ih) = 2 dim®(h). 


Since both summands on the left are either 0 or 2n, the real dimension of § is 
either 0, 1, or 2n. We claim that the dimension cannot be n. 
Assume, by contradiction, that dim® () =n. Then 


h Nib = {0}, h+ib=g. 
Hence, for all ¢, ¢’ € g there exist &, € such that ¢ = & + in, and so 
[6.67] = [&. 0] + [nif] = i([&, -10'] + In. 6']) € 6 Nib = {0}. 


This contradicts the fact that g is not abelian. Thus h = {0} or 5 = g, and this 
proves Lemma 7.6.24. Oo 


Lemma 7.6.25 Let g be a semisimple complex Lie algebra. Then g is a direct sum 
of simple complex ideals. 


Proof Let Cg beareal ideal and let h’ C g be as in (7.6.18). Then it follows from 
“(i) => (iii)” in Theorem 7.6.16 that [h + ib, h’] = {0}. Hence h + ih C h” = h 
and so ih = h. Thus every real ideal in g is a complex ideal. Hence the assertion 
follows from “(ii) => (iv)” in Theorem 7.6.16. Oo 


The next result is a theorem of Cartan [13] which asserts that every semisim- 
ple complex Lie algebra has a compact real form. The proof given here is due to 
Donaldson [17, Lemma 8]. 
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Theorem 7.6.26 (Cartan) Let q be a nonzero semisimple complex Lie algebra 
equipped with a symmetric inner product. Then 


p=, g=Ffoif, (7.6.27) 


where £, p are as in Corollary 7.6.18. Moreover, the group Auto(g) is the complex- 
ification of the maximal compact subgroup K = Auto(g) MN SO(Qg), ie. 


Auto(g) = {exp(id)u | u € K, 6 € Lie(K)} (7.6.28) 
and the map K x Lie(K) — Auto(q) : (u, 5) + exp(id)u is a diffeomorphism. 


Proof Assume first that g is simple. Then g is simple as a real Lie algebra 
(Lemma 7.6.24), and so has a nondegenerate Killing form (Theorem 7.6.10). By 
the inclusions in (7.6.23) the subspace 


h := (€ Nip) + PNif) = (EN ip) +i Nip) 


is a complex ideal in g. Hence it is either {0} or g. If h = g, then it follows 
from (7.6.22) that p = if. Assume, by contradiction, that h = {0}. Then 


£ Nip = {0}, g =f Oip. 
Hence the map o : g — gq, defined by 


o(& + in) := & —in 


for € € € and 7 € p is a Lie algebra homomorphism and an involution. This 
implies ad(o(¢)) = oad(¢)o and so x(o(f),0(6')) = K(€, 0’) for all 6,6’ eg. 
Hence «(&,in) = O for all & € £ and all 7 € p. Thus ip is the orthogonal comple- 
ment of € with respect to the Killing form. Thus, by Corollary 7.6.18, 


p = ip. 


However, for all 1 € p we have « (in, in) = —k (n,n) = —|ad(n)|?. Hence the Killing 
form is negative definite on ip and positive definite on p, and hence p = {0}. This 
implies £ = if. Since the Killing form is positive definite on if and negative definite 
on f, this is a contradiction. This contradiction shows that our assumption h = {0} 
must have been wrong. Thus h = g and hence p = if. This completes the proof 
of (7.6.27) in the simple case. 

For general semisimple complex Lie algebras, the proof of the equations 
in (7.6.27) reduces to the simple case by Lemma 7.6.19 and Lemma 7.6.25. Equa- 
tion (7.6.28) follows directly from (7.6.27) and Corollary 7.6.21. This proves 
Theorem 7.6.26. O 
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Remark 7.6.27 Let g be a semisimple complex Lie algebra. Then Aut(q) is a 
complex Lie group, i.e. it admits the structure of a complex manifold such that the 
structure maps 


GxG>G: (hg) hg, G>G:grg! 


are holomorphic. The proof uses the fact that Der(q) is isomorphic to g and that the 
resulting almost complex structure on Aut(g) is integrable (as it is preserved by the 
torsion-free connection gq"! Vg = 4(g~'g) + [g-!g, g-!g]). Theorem 7.6.26 as- 
serts that the identity component G = Auto(q) of the group of automorphisms of g 
is the complexification of the maximal compact subgroup K = Auto(q) 9 SO(g), 
ie. its Lie algebra Der(g) = ad(q) is the complexification of the Lie algebra 
Lie(K) = ad(£) and the quotient space G/K is contractible. These conditions im- 
ply the universality property that every Lie group homomorphism p : K > G with 
values in a complex Lie group G extends to a unique holomorphic Lie group homo- 
morphism p° : G > G such that p°|x = p. Such a complexification exists for every 
compact Lie group K, whether or not it is semisimple. (For an exposition see [20, 
Appendix B].) 


The following exercise is inspired by a remark in [17, §5.1] concerning a positive 
curvature manifold that is dual to Mg. 


Exercise 7.6.28 Let g be a semisimple real Lie algebra equipped with a sym- 
metric inner product that satisfies condition (iii) in Theorem 7.6.20. Consider the 
complexified Lie algebra 


g (= 9 Gig 
with the Lie bracket 
[5,07] = [€, 8] — [n,n] + i([€, n'] + In. €'1) (7.6.29) 
and the Hermitian form 
(0.0) = (6,8) + (nn) + i((8, 0) — (1, 8) (7.6.30) 


for € = &+ ine g° and ¢’ = &’ + in! € g°. With this convention the Hermitian 
form (7.6.30) is complex anti-linear in the first variable and complex linear in the 
second variable. Prove the following. 


(a) g° is semisimple. Hint: g° has a trivial center and the real part of (7.6.30) is a 
symmetric inner product on q°. 

(b) If g is simple, then g° is simple. Hint: If h° is a complex ideal in g°, then the 
linear subspace 


h := {Re(S) |o € 5°} 


is an ideal in q. 


7.6 


(c) 
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Every real linear derivation on g° is complex linear and has complex trace zero. 


(d) The identity component Auto(q°) of the group of real linear Lie algebra auto- 


(e 


wm 


(f) 


(g) 


morphisms of g° consists of complex linear automorphisms of complex deter- 
minant one. (Complex conjugation is a Lie algebra automorphism of g° not in 
the identity component.) 

The subgroup 


K* := Autg(g°) N SUG‘) 
is connected and is a maximal compact subgroup of Auto(g°). Its Lie algebra 
Lie(K*) & £ + ip 


is the compact real form of g° (Corollary 7.6.18 and Theorem 7.6.26). 
Let d + iW: g > g° bea Lie algebra homomorphism, i.e. for all &, 7 € g, 


PlE.n] = [E.On] — [VEY], LE. n] = (PE, Yn] + (VE, On]. 
Assume @ + iW is injective and denote its image by 
C:= {GE +iWE|E € g}. 


Then the following are equivalent. 

(1) The real part of (7.6.30) restricts to a symmetric inner product on [. 

dD) &*6 4+ W*W e& Aut(q). 

(ID { is a Lagrangian subspace of g° with respect to the imaginary part 
of (7.6.30), i.e. B*W — W*h = 0. 

Hint: If d*d + W*W e Aut(g), then there exists a derivation a € Der(g) such 

that a = a* and exp(2a) = ®*@ + W*wW (Corollary 7.6.21). Prove that 


6 := exp(—a)(®* YW — W*@) exp(—a) 
is a derivation whose image is abelian. Prove that 
«(5&, dn) = 0 


for all €,n € g and deduce that 5*5 = —6* = 0. 

The space of oriented Lagrangian Lie subalgebras { C g° isomorphic to g (that 
can be joined to q by a path of such subspaces) is diffeomorphic to the quotient 
space 


Lg := K°/K, 


where K° := Auto(g°) MN SU(g°) and K := Autg(g) N SO(g). Hint: Choose the 
embedding ® + iW in (f) such that 


O*O+W*V =1, o*Y—-W*d = 0, 


and extend it to a unitary automorphism of q°. 
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(h) The space Lg in part (g) embeds as a totally geodesic submanifold of dimen- 
sion dim(L,) = dim(p) into the symmetric space U(q°)/SO(gq) of all oriented 
Lagrangian subspaces of g°. Hence Lg has nonnegative sectional curvature. 
(Hint: Example 5.2.18.) One can think of the positive curvature manifold Lg 
of all Lagrangian Lie subalgebras of g° that are isomorphic to g as dual to 
the negative curvature manifold Mg of all symmetric inner products on g that 
satisfy condition (111) in Theorem 7.6.20. 


Remark 7.6.29 The idea of minimizing the norm of the Lie bracket was the ap- 
proach to the existence of a compact real form of a simple complex Lie algebra 
suggested by Cartan [14] and carried out by Richardson [58]. One significant dif- 
ference in the method of Donaldson [17], which we follow in the section, is that 
there is no need to assume that the Killing form is nondegenerate, but that this re- 
sult emerges as a byproduct of the proof (Theorem 7.6.10). There is also no need 
to use the structure theory of Lie algebras as in the work of Weyl] [77]. Instead one 
can use the existence of a symmetric inner product as a starting point to develop the 
structure theory of Lie algebras. 


Remark 7.6.30 As pointed out by Donaldson [17], a more direct approach to 
Theorem 7.6.26 would be to carry over the entire program in the present section 
and Sect. 7.5 to the complex setting, starting in Sect. 7.5.2 with convex functions 
on the Hadamard manifold M = Q)(V) = SL(V)/SU(V) of positive definite Her- 
mitian automorphisms with determinant one of a complex vector space V equipped 
with a Hermitian inner product (Remark 6.5.20). 

In the complex Lie algebra setting with Q9(q) ~ SL(g, C)/SU(g, C) the loga- 
rithm of the function fj : Q9(q) — R is the log-norm function of Kempf and Ness 
in geometric invariant theory [20, 38]. Thus the existence of a critical point of fy 
is the polystability condition in GIT. This approach was developed by Lauret [44] 
and he proved that the polystable points are precisely the semisimple Lie algebras. 
This is the content of Theorem 7.6.20 in the complex setting. Lauret’s proof uses 
Cartan’s theorem about the compact real form of a semisimple complex Lie algebra. 

One can also deduce the theorems in the real setting from those in the com- 
plex setting by complexifying the relevant real inner product space V to obtain 
a complex vector space V° = V @iV with a Hermitian inner product, and em- 
bedding the space Po(V) & SL(V)/SO(V) as a totally geodesic submanifold 
into M(V°) = SL(V")/SU(V*). 


Appendix A: Notes 


This appendix explains some notations and standard results from first year analysis 
that are used throughout this book. 


A.1_ Maps and Functions 


The notation f : X — Y means that f is a function which assigns to every point x 
in the set X a point f(x) in the set Y. When Y = R we express this by saying that 
f isa real valued function defined on the set X and, if Y is a vector space, we may 
say that f is a vector valued function. However in general it is better to say that f 
is amap from X to Y and call the set X the source of the map and the set Y its 
target. The graph of / is the set 


graph(f) := {(x,y)e X xY|y = f(x)}. 


We always distinguish two maps with the same graph when their targets are differ- 
ent. 
Amap f : X > Y is said to be 


injective F(x) = f(x.) = x, = x2 
surjective ; iff VyeYaxeXsty = f(x) 
bijective it is both injective and surjective. 


Then 


(a) f isinjective <=> ithasaleft inverse g:Y > X (i.e.go f = idy); 
(b) f is surjective <=> it has aright inverse g:Y —> X (ie. f og = idy); 
(c) f is bijective <> it has atwo sided inverse f-!: Y > X. 


(Item (b) is the Axiom of Choice.) 
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The analogous principle holds for linear maps: if A € R”””, then the linear map 
R" > R": xb Ax is 


(a) injective <= = BA=1, forsome B € R’™”; 
(b) surjective — > AB =1,, forsome B € R™”; 
(c) bijective <=> Ais invertible (i.e. m =n and det(A) # 0). 


(Here 1, is the k x k identity matrix.) However, this principle fails completely for 
continuous maps: the map f : [0,27) > S! defined by f(@) = (cos 6, sin @) is 
continuous and bijective but its inverse is not continuous. (Here S! C R? is the unit 
circle x* + y? = 1.) 


A.2. Normal Forms 


The Fundamental Idea of Differential Calculus is that near a point x9 € U a smooth 
map f : U — V behaves like its linear approximation, i.e. 


F(x) © f (xo) + df (xo)(x — Xo). 


The Normal Form Theorem from Linear Algebra says that if A € R’”*” has rank r, 
then there are invertible matrices P € R”*” and Q € R”*” such that 


Po! AQ = 1, 0+x(n—r) ; 
OGn—r)xr O¢n—r)x(n—r) 
By the Fundamental Idea we can expect an analogous theorem for smooth maps. 


Theorem A.2.1 (Local Normal Form for Smooth Maps) Let U C R" andV C 
R” be open, x9 €U, and f :U > V be smooth. Assume that the derivative df (xo) € 
IR" has rank r. Then there is an open neighborhood Up of xo in U, an open 
neighborhood Vo of f (xo) in V, a diffeomorphism @ : U; x Uz C R’ x R", a dif- 
feomorphism w : Vo > U; x V2 CR’ X R™”, such that (xo) = (0,0), W(f (x0) = 
(0,0), and 


yi lo fop(x.y) =(x, g(x, y)) and dg(0,0) =0 
for (x,y) € Uy x Up. 


The Local Normal Form Theorem is an easy consequence of the Inverse Function 
Theorem. 
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Theorem A.2.2 (Inverse Function Theorem) Let U Cc R”", V CR”, x €U 
and f :U — V bea smooth map. If df (xo) is invertible, then (m = n and) there 
are neighborhoods Up of xo in U and Vo of f(xo) in V so that the restriction 
Fiup : Uo > Vo is a diffeomorphism. 


Here follow some other consequences of the Inverse Function Theorem. The 
terms submersion and immersion are defined in Sect. 2.6.1 and Definition 2.3.2 
of Sect. 2.3. 


Corollary A.2.3 (Submersion Theorem) When r =m the diffeomorphisms and 
W in Theorem A.2.1 may be chosen so that the local normal form is 


wo f of(x,y) =x. 


Corollary A.2.4 (Immersion Theorem) When r = n the diffeomorphisms @ and 
w in Theorem A.2.1 may be chosen so that the local normal form is 


w!o f of(x) = (x, 0). 


Corollary A.2.5 (Rank Theorem) [f the rank of df(x) =r for all x € U, then 
for every xy € U the diffeomorphisms $ and w in Theorem A.2.1 may be chosen so 
that the local normal form is 


wo f of(x) = (x1,...,%;,0,...,0). 


Corollary A.2.6 (Implicit Function Theorem) Let U Cc R” x R” be an open 
set, let F : U > R" be smooth, and let (xo, yo) € U with xo € R” and yo € R". 
Define the partial derivative dy F (xo, yo) € R"*" by 


d 
dy F (x0, yo)v := ri F (xo, Yo + tv) 
tl,=0 


t= 
for v € R". Assume that F(x, yo) = 0 and that dy F (xo, yo) is invertible. Then 
there exist neighborhoods Up of xo in R™” and Vo of yo in R" and a smooth 
map g : Uy — Vo such that 

UxVo CU, — g(x0) = Yo 
and 


F(x,y)=0 ae y = g(x) 


for x € Uj and y € Vo. 
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A.3 Euclidean Spaces 


This is the arena of Euclidean geometry; 1.e. every figure which is studied in Eu- 
clidean geometry is a subset of Euclidean space. To define it one could proceed 
axiomatically as Euclid did; one would then verify that the axioms characterized 
Euclidean space by constructing “Cartesian Coordinate Systems” which identify 
the n-dimensional Euclidean space E” with the n-dimensional numerical space R”. 
This program was carried out rigorously by Hilbert. We shall adopt the mathemat- 
ically simpler but philosophically less satisfying course of taking the characteriza- 
tion as the definition. 

We shall use three closely related spaces: n-dimensional Euclidean affine 
space E£”, n-dimensional Euclidean vector space E”, and the space R” of all 
n-tuples of real numbers. The distinction among them is a bit pedantic, especially if 
one views as the purpose of geometry the interpretation of calculations on R”. The 
purpose for distinguishing these three spaces is the same as in elementary vector 
calculus; it aids geometric intuition. Here is the precise definition. 


Definition A.3.1 An n-dimensional Euclidean vector space is a real n-dimen- 
sional vector space E” equipped with a (real valued symmetric positive definite) 
inner product E” x E” > R: (v, w) & (v, w). Ann-dimensional Euclidean affine 
space consists of a set E” and an n-dimensional Euclidean vector space E” and 
maps 

E"x E" > E": (p,q) p-4q, 

E"xE" > E":(p,v)® p+ov 


satisfying the axioms 
p+O=p, pt(v+w)=(pt+v)+w, q+(p-g=P 


forall p,q € E” and all v, w € E”. The vector p —q € E” is called the vector from 
q to p and the point p + v is called the translate of p by v. It follows easily that 
each choice of a point o € E” determines a bijection v 0 + v from E” onto E”. 
The inner product on E” equips the space E” with a metric via the formula 


Ip—gl=Vip-4.p—9), Bg EE’. 
The standard Euclidean space of dimension 1 is E” = E” = R” with the usual 


matrix algebra operations (x + y)' = x' + y', (x,y) = yo, x'y’. 


Lemma A.3.2. Any choice of an origin o € E” and an orthonormal basis e1,..., en 
for E" determines an isometric bijection: 


n 
I” = Beg eos y x" e; 


i=1 


(the inverse of which is called a Cartesian coordinate system on E"). 
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Lemma A.3.3. If E” > R": pw (x!,...,x"), (y!,..., y") are two Cartesian 
coordinate systems, the change of coordinates map has the form 


yi (p) = do ajxi(p) +0, 


j=l 
where the matrix a = (a!) € R"*" is an orthogonal matrix and v € R". 


Example A.3.4_ Any n-dimensional affine subspace of some numerical space R‘ 
(with k > n) is an example of a Euclidean space. The corresponding vector space 
E" is the unique vector subspace of R* for which: 


E’= o+E" 


for o € E”. This subspace is independent of the choice of o € E”. Note that E” con- 
tains the “preferred” point 0 while E” has no preferred point. Such spaces E” and 
E” would arise in linear algebra by taking E” to be the space of solutions of k —n 
independent inhomogeneous linear equations in k unknowns while E” is the space 
of solutions of the corresponding homogeneous equations. The correspondence be- 
tween E” and E” illustrates the mantra 


The general solution of an inhomogeneous system of linear equations is a particular solu- 
tion plus the general solution of the corresponding homogeneous linear system. 


This discussion shows that a Euclidean space E” is an n-dimensional manifold 
with its Cartesian coordinate systems whose tangent space at each point is naturally 
isomorphic to E”. Thus it is natural to introduce submanifolds of Euclidean space 
as submanifolds of E” whose tangent spaces are then linear subspaces of the asso- 
ciated vector space E”. Instead we have chosen in this book for simplicity of the 
exposition to describe manifolds as subsets of the vector space R” equipped with 
its standard inner product. 
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